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Lecture 1 


1: NON-ABELIAN GAUGE THEORIES 


In 8.323, we have studied the quantum theory of: 


‘L Scalar fields, spin-0, 
2. Dirac fields, spin-, 
3. Maxwell fields, spin-1. 


The last two fields are the content of quantum electrodynamics, an extremely successful description of the electro- 
magnetic interaction observed in nature. For the weak and strong nuclear interactions, however, the fields listed 
above are not sufficient. We require non-Abelian gauge theories. In this course, we will describe the formulation and 
quantization of these fields. In 8.325, we will learn how to use them to describe the weak and strong interactions. 


1.1: SYMMETRIES, LIE GROUPS AND LIE ALGEBRAS 


Noether’s theorem reveals a correspondence between symmetries and conserved quantities: 


Symmetry <— Conserved quantity, 
Spatial translation + Momentum, 
Temporal translation ~— Energy, 
Rotation <— Angular momentum, 


Internal symmetries +— Charges. 


Example 1: Dirac spinor field 
2 = —ith(y"O, — my. (1) 
Note that spinor indices will always be suppressed, and that we use the (— + ++) metric convention. This 


Lagrangian is invariant under the internal symmetry transformation w(t, #) —> e'°a(t, Z). The conserved charge 
associated with this symmetry is Q = f d?x yy. 


Remarks: 

1. All possible phase multiplications e’* form a group, called U(1). U(1) is Abelian, that is, e*1e’°? = 
e’2e'%1_ A Group is a set closed under a multiplication, satisfying the axioms of (i) associativity, (ii) 
existence of an identity and (iii) existence of an inverse. 

2. U(1) is a continuous group. U(1) = S', the unit circle, as a manifold. A Lie Group is a group that is 


also a differential manifold. 
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Figure 1: U(1) as the manifold St with the elements 1 and e’* shown explicitly. 


Example 2: Two Dirac fields 
L = —ih (yO, — mi) — iby (YOu — m2)y2. 
This Lagrangian is invariant under the internal symmetry transformations 
U(1) : d(x) — eg (a), UA) : bo(x) — e*? Y9(2). 


The group formed by this set of transformations is U(1) x U(1), with distinct charges of w and wy. If 
wi 


can then rewrite (2) as 


L = -iV (yd, — mV, 
which is invariant under the internal symmetry transformation 


U(2) : U(a) —+ UW(a), V(x) —> G(x)UT," 
where U is a 2 x 2 matrix such that 
UUT =1. 


The complex matrices satisfying this condition form the group U(2). We can separate out an overall phase 
rotation in these transformations: 


U(x) — el U(x) 


where U = e!* =e’“lg,.2. We can constrain the residual transformation U’ to satisfy 


det U’ = 1. 


(3) 


my, = m2 =m, there is a larger symmetry, since we can mix w, and w2. Introduce VW = es i v= (o, 2). We 


(4) 


(8) 


The set of 2 x 2 matrices satisfying (6) and (8) form the group SU(2). In summary, for m, = ma, (2) is invariant 


under 
U(2) = SU(2) x U(1) 


where U(1) is proportional to the identity matrix and commutes with SU(2). 


Remarks: 


(9) 


1. SU(2) is a non-Abelian group. U;U2 4 U2U 31. It has three continuous parameters (diffeomorphic as a 


manifold to S®). 
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2. Any SU(2) group element can be written as U = exp [ida], with A, constant, a =1, 2, 3. oq here 
are the Pauli matrices, and % are the generators of SU(2), satisfying the associated su(2) Lie algebra 


commutation relations, 


Oa Ob 1 Oc 
—,—| = le€ghe—- 1 
| 2° 2 | ae cat) 
3. For infinitesimal A,, U =1+iA, 4 +..., we have that dU = iA, W, and by Noether’s prescription, we 


find an associated set of conserved charges Qa, (a = 1,2,3), which satisfy the commutation relations 
(Qa; Qo] = t€abcQe (see problem set). That is, the conserved charges satisfy the Lie algebra commutation 
relations. 


4. Starting with n Dirac fields of the same mass, we find the symmetry group U(n) = SU(n) x U(1), where 
U(n) is the set of n x n matrices under matrix multiplication that satisfy UU! = UTU = 1. SU(n) and 
U(n) are examples of Lie groups. 


1.1.2. More information on Lie groups and Lie algebras 


(for a more detailed treatment, the reader is referred to the notes by B. Zweibach, Peskin and Schroeder, §15.4, and 
Weinberg, Volume I, p53-55, Volume II, §15.1-15.2.): 


a. b. C. 


Sao 


Figure 2: a. U(1), the manifold $1. b. SU(2), the manifold $*% (one dimension suppressed). c. An arbitrary 
manifold with a point A, labelled. 


Lie group: a continuous group that is also a differential manifold. Examples include U(1), which is the manifold 
$1, and SU(2), which is the manifold $3. The group elements are labelled g(A,) € G, where A, are parameters 
labelling points of the manifold, a = 1,2,...,dimG. They satisfy a group multiplication: 


g(A) 0 g(A’) = g( A"), (11) 


where 
= fh A): (12) 


Lie groups are tailor-made for describing continuous symmetry transformations in physics: the group product 
corresponds to the composition of transformations. The group product satisfies the properties of (i) closure, (ii) 
associativity, (iii) existence of an inverse element, and (iv) existence of an identity element. Now consider a system 
described by some Lagrangian, #. All physical symmetric transformations form Lie groups. Examples include: 


R''3- space-time translations, 
SO(1,3) - Lorentz boosts and rotations, 


U(n) - internal symmetry groups. 


A compact Lie group is a manifold of finite volume. 
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Figure 3: A Lie algebra as the elements in the infinitesimal vicinity of the group identity element. 


Lie algebra: as physicists, we are often interested in infinitesimal transformations, in order to find the conserved 
currents. As mathematicians, describing the general structure of a Lie group is difficult. The trick is to consider 
elements of G that differ infinitesimally from the identity element, from which one may build up the entire group. 
Choose Aq so that g(Ag = 0) = 1. Then 


glég) = 1+ tegT* + O(e*), (13) 


where a = 1,...,dimG. It is a physicist’s convention to extract a factor of 2 explicitly in the expansion, so that the 

generators are hermitian for unitary group transformations. €, parameterizes the tangent space around g = 1, with 

T“ a basis for the tangent space. Mathematically, egT” € T,G. Putting (12) into (13), we find T® should satisfy 
Pr aare! dad es ame Aa (14) 


where [re Te =T%o0T’—T° oT, defining the Lie algebra g associated with the Lie group G. The f°? are the 
structure constants of the Lie algebra and are necessarily anti-symmetric in a,b. There is another constraint on 
the structure constants: as the group multiplication, 0, is associative, Ee? T °] satisfies the Jacobi identity: 


[E95 | Peers | a eer el 0, (15) 


Hence, 
fee Se Nisa a ats peepee —(. (16) 
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Figure 1: An element A of the manifold of the Lie group G, and the Lie algebra g as the tangent space of the 
identity element. 


Some facts about Lie groups and Lie algebras: 


1. 


Different Lie groups can have the same Lie algebra. The Lie algebra determines the Lie group up to 
discrete choices of global structure. For example, SU(2) = 9°, SO(3) = S?/Zo. 


An invariant subalgebra is a subset of a Lie algebra g’ C g which is closed under the action of g. 
That is, [g, 9’] C g’. A simple Lie algebra is a Lie algebra which does not contain invariant subalgebras 
and which is not Abelian. The complex simple Lie algebras are completely classified: su(n), so(2n), 
s0(2n +1), sp(n), E6.7,3, F4 and G2 are the only possibilities. 


For a compact Lie group, it is always possible to choose a basis of TJ, so that fas. = ff. is truly 
antisymmetric (there is no distinction between upper and lower indices). All internal symmetry groups 
are compact. For example, SU(n) (the set of n x n unitary matrices): 


U =exp[iA°T,], a= 1,...,n?—1, (1) 
where 
Tr(Ti) =0, (T,)' = Th, (2) 
that is, the generators are hermitian and traceless, and hence we can choose 
(Ta, T| = tfarcle; (3) 
where the faye are fully antisymmetric. 
V1 
Physically, for example considering UV = : with an SU(n) symmetry, we find a set of associated 
Un 


Noether charges Qa, @ = 1,...n? -1, satisfying the Lie algebra commutation relations, [2c. Q| = 


1 fis: Then the transformations on WV are generated by the Noether charges: 


O = exp [AQ], (4) 


where [e* Qa, ¥] = T,V. That is, 
Uwut =u, (5) 
where U is given in (1). This is checked explicitly for SU(2) in the problem set. 
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1.2; THE GAUGE PRINCIPLE (QUANTUM ELECTRODYNAMICS REVISITED) 


Referring back to the U(1) invariant Lagrangian we studied in lecture 1: 
L = —i("d,, — m)p, (6) 


which is symmetric under w(t, #) —> e’*¢(t, Z), we note that for the Lagrangian to be symmetric, it is necessary 
that a is not position-dependent. That is, all spacetime points transform in the same way. The transformation is no 
longer a symmetry for general a = a(x), that is, if we allow different phase rotations at different spacetime points. 
The mass term is invariant under these more general transformations. The kinetic term, however, transforms as 


pV > d,(e'*OW(a)) = e',(U(z)) + 19, (a(2) )e'*™ Va), (7) 


where we have kept the xz-dependence explicit. The second term is the problem. We want to construct a theory 
(ie. a Lagrangian) which is invariant for a general a(x), that is, a theory with a local U(1) symmetry. The answer 
involves the introduction of a new vector field, and leads to quantum electrodynamics, as studied in 8.323. This 
example, in fact, embodies a deep principle: the principle of gauge invariance. As we will discuss, 


Local symmetries > Interactions, 
Local U(1)symmetry = Electromagnetic interaction, 
Local U(n)symmetries = non-Abelian gauge interactions. 


To illustrate this principle, we will now “rederive” Quantum Electrodynamics from the requirement of local U(1) 
symmetry. We would like to construct a theory which is invariant under 
w(t, z) = ie able, £), (8) 


for general a(x), also called a gauge transformation. An immediate consequence of (8) is that the ordinary 
derivative loses its physical meaning. Consider the derivative along some direction n“ : 


wb(a + en) = (a) 


€ 


rho = ln 


(9) 
If we can rotate w(a + en) and 7(x) independently, (9) does not have a definite meaning, as can be seen from the 
last term in (7). That is, it does not make sense to compare the value of ~(a) at different points. So, to write down 
a sensible theory including kinetic terms for 7, we need to introduce a new derivative, D,, such that: 
Dyth() — e°™ Dy h(a). (10) 
To do this, assume we have an object U(y, 2) that transforms under (8) as 
U(y, 2) = OU(y, ze). (11) 


U(y, x) “transports” the gauge transformation from « —> y. 


y 


xX 


Figure 2: The parallel transport U(y, x) transports the gauge transformation from z to y. 
That is, 
U(y, 2) h(a) —+ e*MU(y, ale Mel yp(x) = €'* Uy, 2)W(a)), (12) 


transforming as w(y). Since ~(y) and U(y, x)w(x) have the same transformation properties, ~(y) — U(y, x)¢(2) is 
well-defined. 
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Now take y = x + en, and define 


w(a + en) — U(x + en, x)w(x) ; 


M = jj 
nl Dh = lim : (13) 
By construction, ; : 
Dy) — lim eleleten) Dab = Dap (14) 
E> 
and a, 
L = ~i5(y"D, — my (15) 


is invariant under (8). We now want to construct U(y,x) explicitly. Since only local phase multiplication is a 
symmetry, U(y, x) should be a phase, as we don’t want to change other properties of ¢)(x). We begin infinitesimally: 


U(x +en,x) =1+ien"eA, (x) +..., (16) 


where e€ is a constant and A,,() is a real vector field. Under the transformation (8), 


U(x + en, x) —> e+ U(x + en, ze), (17) 
so that ; ; 
1+ ieen" A,,(2) — ered + ien"O,a(x%))(1+ ieen A, (a) )e~**), (18) 
and hence 1 
A, (t+ Ae) 5 noe). (19) 


Finally, we have for the covariant derivative D,): 
Dut = Op — ieAph = (Op, — ieA,,)w. (20) 


Inserting the transformation laws for A,,(x) and w(x): (19) and (8), respectively, we have that D,,~(x) transforms as 
(a). We want A,,(x) to be a dynamical field, and hence we require a kinetic term for this vector field, which should 
be invariant under (19). To construct this, we note that D,,(D,w) transforms as w, and so does (D,,D, — D,D,)¥, 
so we define 


[Du Dy] = [On — teAy,, Oy — ieA,] = —ieF yy, (21) 
FiO Ab Ms, (22) 


and we have that F,,w transforms as 7, so that F),, is invariant. 
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We begin with some comments concerning gauge-symmetric theories: 


1. 
2: 
3. 


A U(1) local symmetry leads to the field A,,(a), mediating interactions between the charge fields y;(). 


No mass term is allowed for A, or the gauge symmetry is broken. 

It is possible to construct theories using other gauge invariant terms, for example 
che? Py By, (Fav Pe’), pF yey’ e- 

All of these terms can be written as a total derivative, or are non-renormalizable. 

Some of the terminology and constructs associated with gauge theories includes: 


U(y, x) : Parallel transport, 
A,,(z) : Connection, 

F,,(a) : Curvature, 

D,,(a) : Covariant derivative. 


These objects form the mathematical framework of the fibre bundle. 


Gauge symmetry is not really a symmetry. The phase of ~ (and part of A,,) does not carry physical 


information; there is a redundancy in our variables. 


Figure 1: Dividing the path y into infinitesimal segments. 


Finally, we consider the explicit form of the finite parallel transport, U(y,x): choose a path y from 


x — y. We split the path into infinitesimal segments: 
Uy(y, 2) = U(y, tn)U (an, tn-1)...U (a1, 2), 

where, from our result from the previous lecture for the infinitesimal parallel transport, 

U (ay, 2) © exp eA, (a) —2)"], 
and hence 

Uy(y, 2) = exp ie [ Ay(a)de : 
Note that U,(y,z) is not necessarily sat Giidesentiea’s in general, Uy, (y,x) A Uy, (y, x). Let 

Ur(#,@) = U_y,(y, #)U>, (y, ) 

be the parallel transport associated with the closed loop shown in figure 2. By (4), 


i ie Aye" yA Sa 45) 
Tr 


Using Stoke’s theorem, we will see in the problem set that 


Up (a,x) = exp ie [ Fyvdenas"| , 
y 


(2) 


(3) 


(7) 


Uy(x) is the Wilson loop, and it is associated with phenomena such as the Aharonov-Bohm effect, and 


Berry’s phase. 
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Figure 2: Paths 7, and y2 from x to y, and the enclosed area ©. 


1.3: NON-ABELIAN GENERALIZATIONS: YANG-MILLS THEORY 


V1 
Now consider V = : and search for a theory invariant under 
Wn 
U(x) — W'(2) = V(2) V(x), (8) 
or, with indices restored, 
Wi(a) — Wi(x) = V;" (2) ¥;(a). (9) 


Here, V(x) isan nxn unitary matrix of unit determinant, that is, V(x) € SU(n). Let us construct the non-Abelian 
generalizations of the objects we studied in the Abelian case. 


A. Covariant derivative: 


Introduce U(y, x) € SU(n), transforming as 
U(y, x) — V(y)U(y, 2)V"(2). (10) 
Again, for y4 = x" + en“, taking the limit « —> 0, we expand U(a + en, 2) : 
U(a-+ en, 2) = 1-+igen* A,,(@) + ~.+; (11) 


where g is a constant, and A,(a) is ann x n matrix. As U(y,x) € SU(n), A,(x) is also necessarily traceless 
and hermitian; A,,(a) = Al, (x). Inserting this expansion into the transformation law (10), we obtain the gauge 
transformation law for the connection: 


Ay, (2) — V(a)Ay(x)V" (a) — “(nV (a))V" (x). (12) 

As before, we define the covariant derivative by 
nl" DW = lim : [W(a + en) —U(a +en,x)V(a)]. (13) 

Hence, we have, with indices written explicitly, 
(DiW)i = O.Vi — ig(Ay)j Vj. (14) 


From (8) and (12), we have that, under a gauge transformation, D,,V(«) —> V(x)(D,V)(«), and so, the Lagrangian 
L = iV (yD, — m)¥ is invariant. 


B. Kinetic term for A,, : 


We note that under a gauge tranformation, 
[Du, Dv] ¥ — V (Dz, Di] Y, (15) 
and that 
[D,,, Dv] =[Ou — igAp, Ov — ig Av] 
=— ig(0,Ay, — 0, A, — ig [Ay, Av]) 
=— igh yp, (16) 
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with F,, an n x n matrix satisfying Fy, = Fi, and TrF),, = 0. Under a gauge transformation, from (8), we have 
that 
FU — FV =VFY, (17) 


and, as W’ = VW, we have that 
Fi, («) = V(«)Fyr(#)V" (2), (18) 


and so F,,, is gauge covariant, as can be checked directly from (12). Hence, Tr(F,,,F"”) is invariant under gauge 
transformations. 


C. The Lagrangian: 


We can now write down an invariant 7: 


L= FT (FF) ~ iW (7D, — m)¥, (19) 


V1 
withY=]| : » Dy = Oy — igAn, Fuv = Op Av — O,Ay — ig [Ay, Ar], and Al, = A,, TrA, = 0. Explicitly, this 
Lagrangian nce eee under the local gauge transformation: 
U(x) — W'(x) = V(a) V(x), 
A, (x) — Aj, (2) = V(x) Ay(x)V" (a) — = (O,V (2))V1 (x), 


F,,, (2) — F(z) = V (2) Fur (x)V" (2), 


where V(a) = exp [iA°(x)T,], and T, are the generators of the Lie algebra, satisfying [Ta,Tb] = tfaneTe. 


Remarks: 
Li A,, is massless; introducing a mass term breaks gauge invariance. 
2. It is convenient to expand A, as A, = AjTo, with a = 1,2,--- n? —1. Here, A, is an n x n matrix, and 
Al, are n* — 1 ordinary functions of x. Similarly, Fu, = F yvta- We have that 
F¢,Ta = Op AGT, — 0) AGT, — ig [AP Ts, ASTe] , (20) 
and so, explicitly, 
F°, = 0,A% — 0,A% + gfarcA® Af. (21) 
3. There is another gauge invariant term which can be constructed out of F' at the quadratic level: 
Cyr plt( he’ FP), (22) 
However, this term breaks CP-symmetry, and is also a total derivative. Nevertheless, this term is 
important at a non-perturbative level, as we will see in 8.325. 
4. Non-Abelian gauge fields are associated with fibre bundles. 


1.3.2: The Wilson loop 


U,,(y, 2) = lim Uta Syste ie Ure) (23) 


= slim, | L(t + igA,(2;)Azt), 
j= 
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with Art = cl | — a, ao = £, In41 = y. Note that the ordering is important in (23), as A, is a matrix, and so 
[A,,(z;), Av(xx)] #0 in general. 


1 


n n j- 
Uy(y,e) =1+ig SA, (x;)Aah + (ig)? Ay (aj)Act Ay (ap) Avg +...+4. (24) 
j=0 j=0 k=0 
Now, we introduce «“(s) to parameterise ¥: 
xz" (0) = 0, 2“(1) = yy”, s € [0,1]. (25) 
Then 
div" ie div" d 
; x . x co? 
Ux(y; 2) =1+ ig f dsAy(a(s1)) + (ig)? [as [ e2Ap(o(s)) Fe Avle(sa)) tet (26) 
ds 1 ds1 ds 
0 0 0 
so 1 Si Sn-1 d ii d 
1 Mn 
=Yotigy" [dss fase... / d8n Ay, (e(81)) —— ... Ay, (2(8n)) —— (27) 
= ds, ds», 
a 0 0 0 
1 
: dx 
=Pexp ig fds Au(a(s) (28) 
8 
0 
=P exp ig f Au(a)de" : (29) 
y 
By construction, under a gauge transformation, 
U(y,a) —> V(y)U>(y,0)V" (a). (30) 


To prove this directly using (12) is slightly non-trivial. As in the Abelian case, in general U,, (y, x) # U,,(y, x). For 
a closed loop [, Ur(a, x) is nontrivial. The non-Abelian generalisation of Stokes’ theorem can be used to relate the 
parallel transport around the loop to the flux passing through the loop. For an infinitesimal loop, 


1 
Up(x,x)U —- VU = gio, (31) 
where a” is the area element encircled by the loop. Under a gauge transformation, 
Up(x, 2) —> V(x)Up(a, z)V" (2), (32) 


and hence, 
Wr(ax) = Tr(Up(a, x)) = T(Pexpig f Adal) (33) 
r 


is gauge invariant. This is the non-Abelian Wilson loop. It is a very important object. 
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1.3.3: Field equations and conservation laws 


We begin with the Lagrangian we considered in the last lecture: 


Gx FTF FH) — iV (>"D,, — mV 
c a Vv a “Ty . ama 
— —qihi Fe °TeT”) — iY (y" (On — ig AZT) — m)W. (1) 


For the gauge group SU(n), we can choose T@ such that Tr(T?T”) = 645, and so c = 1. We now obtain the equations 
of motion. 


A. Varying A,, : 


OF ii, = O06 Ay — OLOAT, + Of be(5AP, AS + A? 5AS) = (D,,6A,)* — (DydA,,)* (2) 
where (D,,dA,)* = 0,0A% + gf ¢-A® OAS. Now, 
if = — 56g, Fe — gl y"T?VS A, (3) 
and so 
(Dum )e = Ie, (4) 
where (D, F#")* = 0,.FRYe + gf%_ AX Ree, and J’* = gWy’T*W. Equation (4) can also be written as: 
o,F Hv = 5” (5) 
with j”* = gby’T*U — Gy esa, eee, 
B. Varying WV: 
(7?D,— m)¥ =0. (6) 


We note for emphasis that this is a matrix equation. Now, we recall that in quantum electrodynamics, 0,,.F"” = j”, 
with j” = eWy’W and 0,j” = 0, that is, 7” is a conserved current. When j = 0, A, is a free field, and we obtain 
free electromagnetic wave solutions. 


Remarks: 

Le In the non-Abelian case, the theory for A, remains interacting with V = 0. In quantum electrodynamics, 
A, is neutral, whereas, in the non-Abelian case, Aj, carries the group index, and so is charged under 
itself, leading to self-interaction. 

2, In terms of Fy, = Ft,T? and J” = J’°T, we have from (4) that 


DF! = J”, (7) 
with D, FY" = 0,F"" — ig [A,, FY”], and under a gauge transformation, 
Fy —VEwV', 
D,,FHY —+ VD, FHV". (8) 


more generally, for any X = XT", which transforms as X —> VXV1t, D,X = 0,X — ig[Ay,X] 
transforms as D,,X —> V(D,,X)V'. From (7), we have that 


JY 3 VI'vVi. (9) 
This will be checked directly in the problem sets. 
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3. Acting with D, on (7), 


1 
Dy DFM” =5 [Dy, Du] FM” 


1 
“2 [Pops Fev] =0, 


and so D, J” = 0. This can also be checked directly from the equations of motion. 


4. J” is precisely the conserved Noether current for global SU(n) symmetry in the absence of A,,. For 
A, #0, J” is covariantly conserved. Of course, (1) is also invariant under global transformations 


U(r) — VU(z), A, (x) — VA, (2)V", (10) 


with V a position-independent SU(n) matrix. The Noether current for this global symmetry is precisely 
j”, which was introduced in (5). From (5), 


Opj" = 0. (11) 
Note that j” depends on A,, non-trivially, which is true if and only if Aj, is charged. j” is not gauge 


invariant: it does not have good transformation properties. 


1.3.4 Further generalizations 
A representation of a Lie group G on a vector space V is a linear action 
g:vEV (12) 


for g € G, v € V, such that 
M1 ° (g2°¥) = (91° ga) -¥ (13) 


where o is the group product. Here, V is the representation space. 


A representation of a Lie algebra g on a vector space V is a linear action 


r:veV (14) 

for g € G, v € V, such that 
[z,y]-v =a: (y-v)—y-(a-v) (15) 
where [, ] is defined using the group product. The concept of a representation is, again, tailor-made for physics. 


Here, V is the physical space, and the same abstract group can appear in different physical contexts, with different 
V (with different representations). We note that a representation for G induces a representation for g, and visa versa. 


Example 1: Angular momentum, SU(2) 


A spin-j representation is described by (27 + 1) x (27 + 1) matrices, acting on a (27 + 1)-dimensional V. For 
SU(n), representing the group as n X n unitary matrices acting on n-dimensional complex vectors gives the fun- 
damental representation. Here, 

Ay = A,T° € g, (16) 
with T® in the fundamental representation. More generally, a representation r of g (or G) of dimension d,. is defined 
by d, x d, matrices pl) that represent the generators, satisfying 


POT" | = afer. (17) 


One can prove that, for compact groups, 
Tr(TLT,”) = C(r) bar, (18) 


where C(r) is a positive number depending on the representation r. For non-compact groups, Tr(TOT) is not 
positive-definite. Amongst all representations, there is a special one for all G (and g). The adjoint representation, 
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which is universal for all Lie groups and Lie algebras, has as its representation space the vector space for the Lie 
algebra itself. That is, V = g (dim V = dimg). The action of the Lie algebra is defined, for « € V = g, y € g, by 


y-v=ly,a]. (19) 


We need to show that this rule satisfies (15), that is, 


yi: (y2°%) — yo: (yr ©) = [y1, ye] - 2. (20) 


The left-hand side of this equation is [y1, [y2, z]] — [y2, [y1, z]], and the right-hand side is [[y1, ya] ,z]. The equality of 
these follows from the Jacobi identity, which is an automatic consequence of the associativity of the group product, 
so (19) indeed gives a representation. For SU(n), the TL) are (n? —1) x (n?—1) matrices. Now we go back to the 
theory, and generalize as follows: we consider a general compact group G in place of SU(n), with W in the vector 


space of some representation r of G. A, = A® a € g. Now we take 
id LU 9 


L= FT (FF) — i (4"D,, — m)W, (21) 


with F,, = F%, ir) Dy = On — igAaTs”. Since Tr(TOT) = C(r)dap, to maintain canonical normalization, 
c= Oy" Here, compactness of G is essential, as non-compactness leads to the wrong sign for kinetic terms of A‘. 
Now, the action of the Lie group G on V = g is given, for g € G, x € g, by 


g:t=gonrog. (22) 


It is easy to check that this satisfies the group action product rule (13). For infinitesimal g = 1+ ty, y € g, this 
action reduces to (19). In matrix form, for x € V = g, x = x°T; (the upper and lower indices can be distinct for 
this general treatment) 


Tee) gf a) = (Pit le aie Ta (23) 
On the other hand, 
TEA) or = (TiO) 2) °T. (24) 
where (Te zye = (TAO Ve a, Hence, 
5 Sti (25) 
Remarks: 
1. Consider M® in the adjoint representation, a = 1,...,dimg; 
(D,.M)* =0,.M* — igAa(Te® - M)* 
=0,M* + gf§.A®M®, 
where the second line follows from (25). In matrix form, with M = M@ ap”) for any representation r, 
DuM =0,M — igAiTe®) . M 
=0,M — igA@T) 
=0,M = ig [Ay M) 
with A, = A®T,. 
2. Under a gauge transformation, 


M—-VMV"t," 
M°T, —-M*VT,V—", 
and hence, 
M* — M?D?, (26) 
where VT,V~! = DOT; 
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1.4; QUANTIZATION OF NON-ABELIAN GAUGE THEORIES 
1.4.1: Gauge Symmetries 


Gauge symmetry is not a true symmetry, but a reflection of the fact that a theory possesses redundant degrees 
of freedom. A gauge symmetry implies the existence of different field configurations which are equivalent. For 
example, in the case of U(1), 


w e()ap 
1 
Ay — Ag+ 2 Onole), 


the phase of 7 is not a physical degree of freedom. Similarly, nor is the longitudinal part of A,,. A massless spin-1 
representation of the Lorentz group has only two polarizations. A, has four components. Thus to have a Lorentz 
covariant formulation, we require gauge symmetries to get rid of the extra degrees of freedom. 


a physically inequivalent 
configurations 


gauge orbits 


Figure 1: Equivalent gauge orbits in configuration space. 


When quantizing the theory, we should separate the redundant and physical degrees of freedom. We need to make 
sure only physical modes contribute to observables. This leads to complications in dealing with gauge theories. 
There are two general approaches: 


1: Isolate the physical degrees of freedom: fix a gauge and quantize the resulting constrained system. This 
method is used, for example, in axial gauge quantization in quantum electrodynamics. 


2. Retain the unphysical modes, or even introduce additional modes, but make sure that they do not 
contribute to any physical observables. This method is used, for example, in covariant path integral 
quantization. 


For the first complication in the path integral quantization, consider, for example, the path integral for a scalar 
field 
[26 ew Saino KO+V(9)-I7b — eV ay eg[UaIT KT (1) 


where K is the kinetic operator (—0? +m?) and K~! is the propagator for ¢. For gauge theories, the inverse of K 
is not defined. For example, in quantum electrodynamics, 


Fy FY = (0, Ap — 0p A,)(O"A” — 0” A*) 
=A,K' A, + total derivatives, 
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with KY = 0?” — 04d". We see that K“”O,a(x) = 0 for any a(x), and so the matrix is singular. These zero 
eigenmodes are the configuarations which are gauge-equivalent to 0. Non-Abelian gauge theories have the same 
quadratic kinetic terms as quantum electrodynamics. In order for K to have an inverse, we need to separate gauge 
orbits with physically inequivalent configurations. 


1.4.2 Fadeev-Popov method: 


Example 1: A trivial example 


physically inequivalent 
\, configurations 


gauge orbits 


Figure 2: The radial direction gives inequivalent configurations, and the circles of fixed radius are the gauge orbits. 


Consider 
W= [ exdy ef (2) 


and suppose f(z, y) only depends on r = ,/x? + y?. Then 
W= / drdo re!) 
= 2m f drrell”, 


where the 27 is the factorized orbit volume. Equivalently, we can insert a delta function. More explicitly, we can 
insert a factor of { ddod(¢ — ¢9) = 1. Then 


w= f do f dvdyel 5(6 — 60). (3) 


The f do integrates over the gauge orbit, and the other factor integrates over a section of non-gauge equivalent 
configurations. 


Example 2: Gauge theories 


We consider pure-gauge theories only; adding matter fields is trivial. 


dim G 


Z= / [] D4e() e's. (4) 


We define a set of gauge-fixing conditions: 


in order to select a section of non-equivalent configurations: 


= 6 fa(Aa(2)) 
1= fT] a(epsctataa)) aor EEO” |. 


Here, the determinant is the determinant of both the color and function space. Inserting (6) into (4), using an 
abridged notation, 
of = 


a= fan f aAe"5(f(Ag)) det | 5A 


(6) 


(7) 
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Now we observe that DA = DAj, as gauge transformations correspond to unitary transformations plus shifts, and 
that S [A] = S [Aq], because of the defining gauge symmetry. Hence, 


Z= [dh { DAne'S495(F(Aa)) det a 


= [an f aAe'$(415(4(4)) det Co ; 


as A, is a dummy integration variable. So, again we factor the partition function into the gauge volume { dA and 
a path-integral over gauge-inequivalent configurations which is independent of A. We redefine this latter factor to 
be the new partition function; that is, 


T= i D Ae’*!4)5(f(A)) det eo (8) 
Example 3: Axial gauge 

fa(A) = Az = 0. (9) 

From this, 

1 

fa(An) = AZ + gore a 9fabcALA°), (10) 

and hence, 

bfa(Aa())|  _ 1 / 

SNE Mice gael —2'). (11) 


We see that the Jacobian is independent of A,: its determinant only gives an overall constant in the partition 
function. Hence, 


Z= [paca [[ 549 (12) 


up to a constant. This is a particularly simple form. However, the drawback is that Lorentz covariance has been 
broken. In a general covariant gauge, both the determinant and delta-function factors are more difficult to work 
with. We need to employ additional tricks. 


(i) Determinant factor 


Recall that 
/ ] [bates Mort = det Mas, (13) 


where the 7, and 7, are independent Grassman variables. Hence, the Fadeev-Popov determinant is given by 


6 fa(Aa(x)) 
dAy(y) 


i [ dtad*ty Gq §fa(Aa(®)) 
ae xd°y Ca (x) 5Ap(y) aco’ oY) (14) 


’ 


det | 


2 / DCa(#)DCa (x) 


A=0 


where C,(a) and C,(x), a = 1,...,dimG, are real fermionic fields with no spinor indices. These are the ghost fields. 
(ii) Delta-function factor 


Again, the method is to write this factor in the form of an exponential. Firstly, generalize 
6(fa(A)) — 4(fa(A) — Ba(z)) (15) 


where B,(a) is an arbitrary function. This does not change the Fadeev-Popov determinant. Therefore, Z is 
independent of B,(a), and so we can weight the integrand of Z with a Gaussian distribution of B,(x). That is, 


Z= [Tl 2Batejetl treat x [PAS 5(F(A)) de “a2 7 | . (16) 
a =0 


oA 
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Collecting (14) and (16), we find for Z, 
Z= [eavc.ne. elSess14,0,0] (17) 


with the effective action Serf given by 


6 fa(Aa(x) 


Se¢¢ [A] = Sym [A] — sg f te 314) + f ted'y Cola) | S5Ap(y) 


Cr(y). (18) 


A=0 


Example 4: Lorentz gauge 


fa( Au) = OV AL. (19) 
We have that (Aj))a(x) = Aii(w) + 5 (OuAa(2) + gf*7Ai,A%), and so the Jacobian is given by 


6 fa(Aa(x)) _ 1 LL cab ac] s(4) 
Ap aos = g [0 babOn +g As] 6 (x y), (20) 
giving 
Sep =. [AL ae os (21) 
with ; 
Lyp =~ 5O"AG), Lyn = f dhe Cu(e)O"D, Cale), (22) 


where D,,Ca(x) = O.Ca(x) + 9 fara A® Ca. From this, we can derive the Feynman rules for the theory. 
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1.5: BRST SYMMETRY, PHYSICAL STATES AND UNITARITY 
1.5.1: Becchi-Rouet-Stora-Tyutin (BRST) Symmetry 


From the last lecture, we have 


Z= 5 DASDC, DO, ef 211A Ol, (1) 
with 
6 fa(Aa(x) 
dAo(y) 
1 


where f,(A) is the gauge-fixing function and So [A] = 74, F¥”* is the pure Yang-Mills action. So [A] is invariant 
under the gauge transformation 


Sef f [A,C, C] = So [A] — x | ts f2(A) +f a'ea'y Cy(x) | 


Cr), (2) 
A=0 


ARs AZ DN (3) 


where D,,A® = 0,A°% + fPPArK, We note that in (1) we integrate over all Aji(x), including the unphysical 
configurations, but by construction Z should only receive contributions from the physical A%(x). We also note 
that Z = (0,+00| 0,—00). Sepp [A,C,C] no longer has gauge symmetries, but it has a hidden global fermionic 
symmetry, the BRST symmetry, which is, in fact, a remnant of the gauge symmetry. To see this, it is convenient 
to introduce an auxillary field ha(x) : 


Z= j DAS DhagDCaDCy e491 An 0,64 (4) 
with 
Ser¢ [A,C,C,h] = So [A] + s fata he + fate ha(z) fa(2) + Lon- (5) 
Now, consider the following (BRST) transformations: 
dpAy, =7(DyC)* = ns(Az) (6) 
dpC* = — nh* = ns(C”) (7) 
1 
dpC® =— gant C'C* = 7s(C*) (8) 
dph® =0 = ns(h") (9) 
with 7 an anticommuting constant parameter. Then, in general, 
5nd =ns(b), = A,C%,0% he. (10) 
s(@¢) takes ¢ to a field of opposite ’fermionic parity’. We note some of the important properties of s : 
i. 
8(¢162) = 8(¢1)b2 = $15(¢2), (11) 
where the + sign is for ¢; bosonic, and the — sign is for ¢, fermionic. 
ii. 
s°() =0. (12) 


For example, s?(C%) = 0 and s?(C*) = 0, which follows from the Jacobi identity. 


Lecture 6 8.324 Relativistic Quantum Field Theory II Fall 2010 


iii. From (i) and (ii), we have that 


s*(F(¢)) =0. (13) 
iv. s(A/i) is the same as the infinitesimal gauge transformation of Aj, with A® replaced by C*. 


Based on the above properties, we will now prove that 685 = 0. 


We first show that 
S= So + fate s(F(a)) (14) 


with F(x) = —Cafa — §Calha, so that 
s(F(x)) = Rafa + Cas(fa(Ay)) + 3ha- (15) 


This can be established by showing that 


7 = Ofa(Aa(x 
[aieCla)s(JulAn(oy) = f atyateeaa) [PAN] | cx, (16) 
ONp (y) A=0 
which is left as an exercise to the reader. Then, we have that 
SS = 5p +n f de s*(F(o)), (17) 
and these terms are separately zero by the properties (iii) and (iv) shown above. 
The BRST symmetry implies the existence of a conserved fermionic charge Qe. 
dnd =i[nQz, 4] = ns(¢) (18) 
or, equivalently, 
5(¢) =t[QB,9]4 
_ filQz,4], — bosonic, 
7 i{Qp,¢}, @ fermionic. 
Since s*(¢) = 0, we have that 
[Qz,[Qz, d],|_ =0. (19) 
That is, [Q3. | = 0 for any ¢, and hence, 
Q?, =0. (20) 
We can also define a ghost number, which is conserved: 
gh [C] = 1, gh[C] = —1, gh[Qz] = 1, gh{d] = 0 (21) 


for any other field d. 
1.5.2: Physical States and Unitarity 


Physical states should be independent of the gauge choice. Z = (0, +00] 0,00) is so by construction, as it should 
be independent of f,(A). We now consider more general observables. More generally, we should have that 


0 = dg (fl t) = #(F] 495 |?) (22) 
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where 6, represents the change under the variation of the gauge-fixing condition f,(A). Note from (14) we have 
that 


5,8 = f d'xs(5,F(0)) 
ae / dx s(C65 fa) 
== if as {Qp,Cadfa(A)} ’ 


and so it must be true that 


[ aes {Qo,CubfalA)} Ii) =0 (23) 
for arbitrary 6 f(A) for a physical observable, and so 
Qe |t) = Qs |f) =0. (24) 
That is, a physical state |) should satisfy 
QB |p) = 0. (25) 
Similarly, by considering 
5g (f| Or... On |t) = 0, (26) 
we find that 
[Qz,0] = 0 (27) 
and so O should be gauge invariant (if it does not contain ghost fields). Note that any state of the form 
|) = Qa...) (28) 


satisfies Qp |w) = 0, but that in this case, (x| w) = 0 for any physical state |v). Such a state |W) is called a null 
state. All physical observables involving a null state vanish. If |w’), |W) satisfying (25) are related by 


lw’) =|¥) + Qa|...), (29) 


they will have the same inner product with all physical states, and thus are equivalent. We introduce 


Helosed = {|v) : QB |W) = Ot 
Hoexact = {|v) : |) = QB |. : ots 


Horosed 
KHohys = ; 
Hexact 


That is, Zpnys is the cohomology of Qs. In summary: 


1. Defining #%i, to be the Fock space composed from A,,,C, C, we have that 

Sephiaja— closed — Pehig: (30) 
2: By restricting to Zpnys and gauge invariant O, (f|O1...O, |i) does not depend on the gauge choice. 
3. Our path integral construction guarantees that only physical states contribute in the intermediate state. 


Example 1: Quantum electrodynamics in Feynman gauge (€ = 1) 


1 = 
f= FF FY 5g (A) + ,CH#"E 


19, Av) (GHA) + 8,CO#E 
9 ad Lu 
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Under the BRST transformation: 


bp A, =n0,C 
OBC = 0, A" 
dpC =0, 


and so 
(Qe, Ay aaa? 10,C 
[Qp,C] =i0, A" 
(QB, C] =0. 


It is left as an exercise for the reader to find the explicit form for Qg. Now, we set ig to be the set of states formed 
by acting with creation operators for A,,,C,C on the ground state |0). Imposing Qz |) = 0 gives us Hetosea and 
Apnys- For illustration, consider the one-particle state: 


Mig = {l€u,P) |e, Pp), |e, p)}. (31) 
Then, Qz |e,p) = Qge. A|0) = —e.p|c,p) from [Qg, A,| = —i0,,C, and we obtain the physical state condition: 
ep=0 (32) 


For e.p # 0, we get|c,p) null states. Qg|é,p) «x py A"(p) |0) A 0, and so the |é,p) are non-physical states, and 
pp A" (p) |0) = |e = p, p) is a null state. So, we have that 


Hpnys = {\e,p) : ep =0, ce ~ e4 + ph}. (33) 


Take p" = (p°,0,0,p?), p? = 0. Then, e.p = 0 implies that e“ = (p°,e1,e2,p*), and e ~ e +>: implies that 
e" = (0,e1, 2,0), and so, only transverse components of A,, generate physical states. 


Remarks: 

1. While Ap |0) creates negative-norm states, they do not lie in the physical state space (giving a positive- 
definite norm on the physical state space). 

2. Ghosts C,C make sure that these negative norm states do not contribute in intermediate steps. 
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2: GENERAL ASPECTS 


Now that we have studied almost all known quantum field theories, we return to studying physical observables in 
these theories. We will start with the simplest object: 


(0| O(a )O(y) |0) , (1) 


that is, the vacuum two-point function. It is also one of the most important objects. We will first describe its 
general structure based on symmetries, unitarity and analyticity, without the specifics of any particular theory. We 
will then consider an example in a specific theory. 


2.1: FIELD AND MASS RENORMALIZATIONS 


For illustration, consider a scalar field theory, with Lagrangian (¢) which is Lorentz invariant and translation 
invariant. We examine 


Gr(a,y) = (0| T(¢(x)b(y)) |0) , the Feynman function, 
Gi(x,y) = (0| (2) d(y) |0) , the Wightman function, and 
Gr(a,y) = O(x° — y°) (0| o(x)o(y) |0) , the retarded Green’s function. 


Here, O(x°) is the heaviside function. Translation invariance implies that Gr(r,y) = Gr(x — y), and Lorentz 
invariance implies that Gr(a — y) = Gr((a — y)*). In momentum space, we have that 


Gr(@—y) = / CP eG pp) (2) 
F\L Y= (n)t° F\P); 
and by Lorentz invariance, Gp(p) depends only on p?. Recall, in the free theory, 
$=-5 pon p — snag? (3) 


and G (p; me) = 27O(p°)d(p? + m2). The delta function constrains the support of the Green’s function to the 
mass-shell, and the heaviside function constrains the support to the positive-energy sheet. 


ey ae im 
Gr(pim’) = ae (4) 
Now, for a general interacting theory, 
Gy (x,y) = D> (0| P(x) |r) (n| Oy) |0) , (5) 


where we have just inserted a complete set of states, and where the summation heuristically also represents an 
integration over momentum. Now, 


(0| (x) |r) = (0| ew"? *b(0)e"”* |n) 
e'Pn® (0 9(0) |n) , 


and so, 


= SF? |(0| (0) |n)I?, (6) 
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or, more explicitly, 


Cxe—u)= f Gaye PAR) 6 Pa) (01 (0) MP (7 


We define (27)@(p°)p(—p?) = (27)4 >, 5 (p — pn) |(0| 6(0) ae a Lorentz invariant scalar function of p, with 
p(—p”) = 0 for p? > 0, as all Jn) should have E,, = p® > 0, and so p? <0. We thus find that 


Gs.(p) =27®(p") p(—p”) 
=| dy? 276(p? + w7)O(p°) p(n"), 


and so, for an interacting theory, we obtain the general result: 


C10) = dy? p(u2)G (p; p2) 


0) 


Sy 


Golo = fay? ute e— ys?) 


where p(2) is the spectral function. Now we consider the structure of p(—p*). We have that (27)O(p°)p(—p?) = 
(2x)*>.,, 5) (p — pn) |(0| o(0) In) |", where )7,, is a sum over all physical states: that is, a sum over single-particle 
states and multi-particle states. Firstly, 


single- p>, states 


where i indexes the particle species, and wy = \/m? + k2. Other than the m?, this sum is completely determined 


by Lorentz symmetry. Now, we consider the multi-particle states. For a given k, a continuum of w are allowed. For 
example, for a two-particle state at k = 0, 


1 
3 Qwy 


Fa ae (k? +m’), 


Y 


SS 


m~ 


Figure 1: A two particle-state with zero total momentum. 


w= [BB + m2 + [kB + m3, (8) 


w forms a continuum starting atm, + m2, and so, k? starts at —(m1 + m2)”. We thus find 
(27) O(p° =D | oy ag 27 5(k? + m7?) O(K°)(2m)45M (p — k) |(0| 6(0) |k, 4)? +27O(p)o(—p?) (9) 


where the second term is the contribution from multi-particle states. Hence, we have that 


p(—p?) = o(—p”) + 9 5(p? +m?) Z; (10) 
with Z; = |(0| ¢(0) |k, 2)|? a number which is independent of k, since k? = —m?. o(—p?) is non-zero for —p? > 4m?, 
where m is the smallest single-particle mass. So, for the Feynman function, we have 

—iZ; ae —i 
G = ee dy? —_———— 11 
si aerate pew pea Oy) 


It is convenient to introduce a function defined for a general complex parameter s, 


I(s) = D 1, a 7: dp? o(:?) = 5: (12) 
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Then Gp(p?) = I'(s = —p? + ie), and so, we obtain the Feynman function by approaching the real s—axis from 
above. We observe two features of '(s) : 


1. T(s) has poles at single-particle mass-squared values: s = mi. 


2, There is a branch cut beginning at 4m? with a discontinuity I(r + ie) —I'(r — ie) = 2ra(r). 
g g 1 


m m 4m; Re(s) 


Figure 2: The function I'(s) plotted on the complex plane. 


Now, consider 


1 1 
L = ~50,60"6 — mio? + Lr(4,), (13) 
and suppose that as A —> 0, # —> 0. At \ = 0, ey (p) = ae: If ¢ does not have any bound states, then, 
0 
for A 4 0, we have 
Ge(p) = a — + [ty ow?) (14) 
ptm —ie Jam? pe? + pe —ie’ 


In general, we note the following points: 


il m? # mé : we have a mass renormalization. m2, the bare mass which appears in the Lagrangian, is not 
necessarily m?, the physical mass which appears in the propagator. 


2. Z #1: we have a field-strength renormalization. Z = |(0| ¢(0) |k)|”. These first two points are generic 
to interactions, and have nothing to do with ultraviolet divergences. 

3. ¢(0) |0) also generates multiple-particle states. 

4. We define ¢pnys = WZ, meaning |(0| phys (0) |k)|? = 1. 
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In the last lecture, we showed, for a general interacting theory: 


-1Z < a 
p2+m—ie Jame bok are r 


Gr(p) = (1) 


where the first term is the contribution from single-particle states, and the second term is the contribution from 
multi-particle states. From this, we have that 


Im(iGp(p)) = S— 15(p? + m7)Z + wo (—p”). (2) 


a 


This is the spectral function, picking out the physical on-shell states. There is one more sum rule we wish to observe. 
Begin with the canonical quantization: 


[2¢,2), 06,9] = —i0(@ - 9. (3) 
As this operator is just a complex number, we can equate it with its expectation value: 


(0| [4(¢, #), (4, #)) |0) = - 10 - 9) 
=; (0| H(t, 2) (tT) [Oly se — A (01 (TOE. B) [Ol 
=O64(2- ype ~ GLY ly 


=| du? p(t") [ ac? (w= y; wy) — AGO (y — a; p?) 
0 


tot 


| 


=2 | dpi? p(u?) AGP (wx — y; nu) 
0 


tt 


Recall, in the free theory, we have that 


a 
AGM(e-y)] =-56(@-9), (4) 
t/t 
and so, we have that 
1= / du? p(t"). (5) 
0 


Because p(u?) = o(u?) + >, Z;6(u? — m?), where both terms are greater than or equal to zero for all values of 117, 


we have that ws 


du? o(u?) <1, Z% <1. (6) 


4m? 


In reality, the above argument may not always hold due to possible ultraviolet divergences. The same discussion 
can be generalized to a spinor, as seen in the problem set, and to a vector, as we will discuss later. 


2.2; AN EXPLICIT EXAMPLE 


Consider a scalar Lagrangian with a ¢° interaction: 


1 1 1 
are iw) HA pd 2 = 3 
L= 5 po" mob + 59? : (7) 
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This leads to the Feynman rules: 


p - p? +m? — ie’ 
= 19, 
and by dimensional analysis, we find 

d—2 
[?] [5 

[z] =e 1, 
d 
ae 
[9] 5 


where we work in a general space-time dimension d. We note that g is dimensionless in d = 6. 


ihe — 


Note that we do not consider diagrams involving tadpoles: 


(101) =a=—C_) +] )+- 


(10) 


as we can always shift the field definition to ¢ = do + 4, so that (0| db |0) = 0, and the sum of the tadpole sub- 
diagrams gives zero. Now, we define 1PI, or one-particle irreducible diagrams, as the diagrams which cannot be 


separated into two disconnected parts by cutting one propagator. We denote the sum of 1PI diagrams by 


Pp 
and so, we have that 


(11) 


(12) 
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That is, 


Gr(p) =GP + GO ie? + eM meO me? +... 

go} 
* 1-aG® 

1 

ae 

(ci) -a 

7 ~i 

— p? +m — II(p) — te 


where II(p) is the self-energy. We note that II is, in fact, a function of p? only, by Lorentz symmetry. Before 
evaluating II(p”) explicitly to lowest order, we make two remarks: 


1. The physical mass, the pole of Gr(p), is given by p? + m2 — II(p”) = 0. That is, the physical mass m? 
satisfies 
m? — m2 +II(—m?) =0 (13) 
2 The field renormalization Z, the residue of the pole, is given by expanding around the pole to lowest 
order: 
Ir (P)|p2mm2 = 
SEP) p22 —m? ~ pp? + m2 — IV (—m?)(p2 + m2) — ie 
td 1 
— p? +m? — ie atl , 
7 dp” 2 =? 
-1_47_ di 
and so Z~* = a? | oan" 


We now proceed to evaluate I(p?) to the lowest order in g: 
qtp 


iIl(p?) = —)~ (14) 
Pp 


d 
ami(p®) =5 (ia)? | SAGGP GP (a+ P) 


-£ f dq 1 
2 | (m4 G2 + m3 — teat py? +m — ie) 


We now evaluate the integral explicitly, using a series of tricks. Firstly, we note the identity, due to Feynman: 


1 1 
——-=| dx1-+-d&p 6(@1 +++ +4, —1)(n—1)!(e1a1 +--+ + 2nan)~". (15) 
111+ Qn ‘ 


Thus, our integrand can be rewritten as 


= dx 
(q? + mg — ie)((q + p)? + mG — ie) i [x((q + p)? + m2) + (1 — 2) (q2 + m2)? 


= fw _, 
a [eg ap) FD] 


with D = m3+a(1—2)p?. Next, we perform a Wick rotation. In Lorentzian signature, the integral is not convenient 
to evaluate, because of the poles near the integration paths. Thus, we rotate the gq contour to the imaginary axis 
along the direction shown in figure 1. So, we let go = iga, and therefore q? = q?+...+q3 = gp, and f d4q =i [ d@qp. 
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Im(q) 


Ka as Re(q) 


Figure 1: Illustration of the Wick rotation of the variable qo. 


Combining our two results, we have the self-energy in the form 


p’) _ ig? Ef a > f oe qE Tess (16) 


We now proceed to the final evaluation. We observe that (16) is convergent only for d < 4. We may evaluate it for 
d <4, then analytically continue its value for d > 4, treating d as a complex variable. We make use of the general 


formula 
il d"qz (an)* - T(b—a—$)l(at §) 
(27)4 (qe + D)P (4m) 2T(b)T(4) 


In this case, we have b = 2, a= 0. And so, 


area). (17) 


2T(2- 4) f 
ne?) =F fae : (18) 
2 (40)? Jo (mj +a(1—2)p?)?-? 
Example 1: d=3 
2 1 
1 
I(p?) = 2 [a = 19 
P= Ten Jy Gago — ap oy 


First, we consider the physical mass to O(g?). We wish to find the solution to (13). Note that the self-energy is of 
order g’, and so the solution is given to O(g?) by 


A de 1 
167 mo Jo (1—a(1—<2x))2 


g 
— —— log3. 
167m 083 


The second term is the mass renormalization due to the interaction, to lowest order. Now, we consider the field 


renormalization, Z~! = 1 — a | sh and note that again, as II(p?) is of order g?, to lowest order we have 
=—-—m 


pre Tog. ee. 
tte ea) : (1—2(1-2))? 
1 


ree 


0.23g 
1+ 32nm3 
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where the integral in dx has been evaluated explicitly. Finally, we note that for —p? > 4m, we have that mé@ + 
x(1—2)p? is smaller than zero for a range of « between 0 and 1. Therefore, II(p?) becomes complex. It is convenient 
to consider 


Ca 1 
II(s) = dx 20 
©) 167 [ (m2 — «(1 — x)s)? ey) 
as a function of a complex variable s, with II(p*) = II(s = —p? + ie). H(s) has a branch point at s = 4m@. This is 


precisely the multiple-particle cut predicted in the general formalism last lecture. Note that m? = m2 + O(g?). We 
can now understand the physical interpretation of this result: 


iTl(p?) = ~C)- (21) 


to lowest order, and when —p? > 4m2, both the intermediate particles can simultaneously go on-shell. Im(iGr) = 
mo(—p”) becomes non-zero and Jm(—p7) is just the Feynman diagram evaluated with both the intermediate particles 
on shell, giving a factor of 6(p? + m?) for each propagator. 
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2.3: REMOVING ULTRAVIOLET DIVERGENCES 


Let us consider the same ¢? example, but now in d = 6, where the coupling constant g becomes dimensionless. 


=f fa of a dae 17 eae (1) 


with D = me + x(1 — x)p*, which is quadratically divergent. The integral over the momentum can be expressed in 


terms of gamma functions, giving 
2 d 1 
II(p”) = ga? dx es (2) 
(4r)2 Jo D?-2 

As d —> 6, this quantity diverges. A similar result holds for quantum electrodynamics. Divergences in physics are, 
actually, a good thing: they signal missing information in a model, and lead to new physics. In this case, the diver- 
gences are emerging at gg —> co. Thus, there is potentially new short-distance physics, which is to be expected, 
as in a grand unified theory or a quantum theory of gravity. We need to decide if this new physics is important for 
what we want to understand and calculate. There are two possibilities: either the new physics is required to make 
calculations, or it is possible to extract physical results in our regime of interest which are insensitive to the ul- 
traviolet phenomena. It turns out that this second possibility is true, and the procedure is known as renormalization. 


In the late 1930s, Dirac, Pauli and Jordan developed quantum electrodynamics. Divergences in the loop dia- 
grams were soon found. Sensitive new experiments, such as the anomalous magnetic moment, g 4 2, and the Lamb 
shift, meant that these loop corrections were important. A younger generation of physicists, including Feynman, 
Schwinger, Tomonoga and Dyson, developed the systematics of renormalization, and used it to calculate finite, sen- 
sible answers which gave extremely precise and accurate predictions. The understanding of renormalization didn’t 
come until the work of Wilson in the late 60s and early 70s. 


In our situation, the divergence in II(p?) means that, from m? — mz = —II(m?), the corrections to m? are in- 
finite, and Z~+ = 1— ae is also infinite, so there is an infinite field renormalization. As a clue to how we 
pie= 


resolve this, we note that we measure the physical m?, which is finite. Similarly, if we introduce Pphys = , then, 
for Pphyss 

—i 
p? +m? — ie 


Gr(P)|p2~2—m2 et ) (3) 
with Zpnys = 1, that is, |(0| dpnys Ip) | = 1. dphys is, again, something we directly construct from physical mea- 
surements. So, the bare quantities, m2, ¢ are ill-behaved, but this is fine, as they are not directly observable. The 


strategy, therefore, is to express the Lagrangian in terms of the physical mass and field: 


1 1 1 
L=- 5 (Obs) 7 5 0bB a 5 90%B 
ae L 2s oh 272 1 2 
a 5 20(99) 5 Zmm cw) 299? 


where ¢ is the physical field, m is the physical mass, and g is the physical coupling. By definition, we have that 
(0| $(0) |0) = 0, |(0| 6(0) |p)|? = 1. (4) 


op, gp, and mg are the bare quantities appearing in the original Lagrangian. They are related to the physical 
parameters by 


Zm Z, 
OB = Zo, mo = zm Jo = {9 (5) 
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where Zy, Zm and Z, are potentially divergent. We now write 


with 


where A = 


Remarks 
1. 
2: 


L=aA+LZr+ Lu (6) 
= - 2 _ 1 22 
1 
Ly =. gf? 
Lu =—+A(06)? — LBm2g? — Loge? 
. 2 a 6 ; 


25-1, B= 2,-1,0=Z,-1. 


A, B and C are potentially divergent, but that is allowed, as they are not observable. 


Since tree-level diagrams do not change m?, ¢ or g, the differences between bare and physical quantities 
come from loop corrections. In perturbation theory, A, B,C ~ O(g?) and higher. 


We have a set of modified Feynman rules 


D 7 p? +m? — ie’ 
= 19, 
—*—_ =_ Ap’ — Bm’, 
Pp 
= 17. 
(7) 
with the counter-terms treated as additional vertices. 
A, B, and C will be determined order by order in perturbation theory as follows: again, 
: 2 1 
iGr(p") (8) 


~ ppm? — Tp) — ie’ 


but now, to O(g?) 


and we have a contribution from the counter-term vertex. We now require 


II(p? = —m?) = 0: the physical mass condition, and 


dil. 5 


ap =—m?) =0: the physical field condition. 
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giving two constraints which can be solved to find A and B. A similar condition for the interaction 
vertex: 


ig= (10) 


can be used to determine C’, but we will not discuss that here. 


Let us now determine A and B to O(g?). To deal with divergent quantities, we first need to regularize them, for 
example by introducing a momentum cutoff : 


=f fa of = ve Ap? — Bm? (11) 


taking the limit A —+ co at the end. Now, D = m? + x2(1—x)p?; that is, mo has been replaced with m. It is often 
more convenient to use a method known as dimensional regularization. For general d, 


2T(2—4) fi 1 
g M2 3) = ap =ipet (12) 
2 (4)? Jo D?-2 


and for d = 6, [(2 — $) =I'(—1) is divergent. We will instead take d = 6 — €, taking the limit « —> 0 at the end. 
Note that [g] = § in d = 6 —« dimensions. It is convenient to write g —+ gu2, where g is now dimensionless and pz 


is some arbitrary mass scale. Hence, 


2u€T 
etal 2 Df arDt-* - Ap? — Bm?. (13) 


II(p?) = 9 =e 


The steps are now to expand in ¢, absorbing the divergent parts into A and B, and then to use the physical mass 
2 

and field conditions to determine the finite parts of A and B. Introducing a = an and using the expansion 

T(1+ §) =—2+ (7-1) + O(e), where ¥ is the Euler number, we have 


ni?) = S(-2) [ deD+<= a =f) yf dx D — [a Di0e aa Ap? — Bm? + O(€). (14) 


We note that ie dx D = m?+ ae So, the divergence can indeed by cancelled by A and B. Introducing Dp = D(p? = 
—m’*) = m?(1—2(1—2)), we have 


I(p?) = al de Diog( =) 4 > ( *+(y-1)4 loe(*)) (m? +?) — Ap? Bm?+0(0), (15) 


and so, cancelling the divergent parts into A and B, we have a finite remainder 


=3f dx Dlog(5 a) +" + bm?. (16) 


We now impose the physical mass and physical field conditions, giving 


I(p? = —m?) =0s>a=b 


Tl 1 
a — (p? =-m)=050+5 [ dxa(1—2) =0. 
And so, we find a = b = —&, and our final result is 


ip? 


=e). dx Dlog(5 == os +m?) (17) 


which is completely finite and well-defined. 


Remarks: 
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1. For A and B, we have 
2 D p a 
Ap? + Bm? = 2 (—-24(y-1)41 24 PY SO 2 4 mm? 18 
p+ Bm? = 5 (2+ (7-1) + low ga ")) (m? +E) + Ep? + m2), (18) 
which is completely determined, and divergent as « —+ 0. Note that the expression is O(a) where 
2 
a= das £ ae 
2, While A and B depend on the mass scale 4, which was introduced arbitrarily, physical quantities like 
II(p) do not depend on up. 
3. For large p?, D ~ p*a(1 — x), and so 


2 
Pp 
[M(p*) | 52500 =a |cyp? + com? + c3p” log( a) : (19) 


If a is large, alog( 2) can be large and perturbation theory can become invalidated. Later, we will 


discuss this in the context of the renormalization group. 
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2.4; UNSTABLE PARTICLES AND RESONANCES 


We recall from previous lectures that the Feynman propagator for a general interacting theory is given by 


Gr(p*) =I(s = —p* + ie), (1) 
where 
(3) = at [ atow’) 2) 
8 ms 4m? 5 —_ pe’ 
and the spectral density function is given by 
pl”) =—Im(iGr(p Vers’) 


For py? > 4mz, we have that p(u*) = o(u*) = x:Disc(T(s)|,_,,2). The reason for this is that we cannot have 


Im(s) pr) 


m a 4m Re(s) m m, 4m; H 


Figure 1: Plots of the function I'(s) on the complex plane and the spectral density function p(?). 


ms > 4m? for any particle 7 which interacts with the others. We can imagine changing the parameters of the 
theory, so that the pole moves beyond the branch point and sits on the branch cut. We will consider what happens 
in this case. Consider the Lagrangian 


1 1 1 1 1 


and examine the Feynman propagator G2 (p?). Suppose that 2m, > my > m,. Then, perturbatively, 


1 


iG. (p*) = ; 
1 P(p) p? + m3 — II) (p?) — ie 


The three-point vertex rule is given by 


o 
--- = i, (5) 
x 
b 
and hence, to lowest order, 
nO ~)=— TL, (6) 
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and we see that II‘%)(p?) becomes imaginary if —p? > 4m. Naively, the pole just sits on the branch cut. But, as 
we will show, the pole, in fact, moves to the other sheet of the complex plane. Physically, the particle becomes 
unstable since it can decay ¢ —> 2x, where the imaginary part of the pole is proportional to the decay rate. In the 
Hilbert space of physical states, no states correspond to a single ¢-particle. It becomes a resonance. 


Im(s) Im(s) 


— 


2 Re(s) 


rc) % ua 2 (Second sheet unstable resonance) 
0 


pr) 


SS 


Figure 2: Formation of an unstable resonance for mg > 2m,. 


From the last lecture, after subtracting divergences using counter-terms, 


\ 
TI) (p?) = = — 
Psa" 
_ @ : 12 12 
=e dx Dlog D — a'p* — b'ms, (7) 
0 


where a = ao D =m + x(1—<2x)p’, and a’ and 0! are finite real constants, determined by the physical mass 
and physical field conditions: 


dll 
ape? = —m?*) =0. 


But, for mz > 4m, log D is complex at p? = —m%, and there is no way to satisfy these two conditions for real a’ 


and b’. Therefore, ey (p”) cannot have a pole on the real axis. Physically, ¢ cannot be a physical single-particle 
state in #. What can be done to resolve this is to modify the physical mass and physical field conditions to: 


Re (II(p? = —m”)) =0, 


Re (Bo = -m)) =0. 


which gives suitable real solutions for a’ and b’. We then have 


1 
a D a 
0) 
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with Do = D(p? = —m4) = m5, — «(1—x)m%,. Let us define mgl = ImI(p*) | 2— ne . Then, near p? = —m?,, we 
have 
1 
iG 2 _ 
eee p? +m — II(p?) — te 
_ 1 
pet mé, —imgl 
Thus, there is a pole at p? = —ms, + imgI. In terms of the complex function I'(s), there is a pole at 
$= mi, —imgI, (9) 
and so, around the ¢-mass, the spectral density function takes the form 
1 mol 
2 F @ 
= —ImM(iG = + 10 
POH asemg = = MEG e(P*)) ™ (u2 — m2)? + m2I? wo 
Now we proceed to calculate ine 
Im(TIE")| yo mg) =f de Im(Dolog 7), (4) 
, 6 0 |Do| 
where Do =m — «(1 — a)m%, — ie. Now, Do < 0 for the range 
ee 4m? Tt 4m? 
1 X<cg<i4 1 x 12 
2 2 Cee ale m3 (12) 
and in this region, log By = —in. Therefore, letting 6 = ,/1—- le we have 
\ @ 
3(1+8) 
os an [2 
Im(II(—m3)) =— a dx (m< —a(1- x)m3) 
3(1-8) 
ss ee 
and hence, 
~ Ta 
T= —m¢* 1 
72 79? (13) 
In the problem set, we will see that when mg > 2m,, we have 
“ TH 4m 3 
Dgsax = Fy moll — mee )?, (14) 
and sol = eo We now consider a couple of additional points. 
Remarks: 
1. Consider the function log z. The function increases by 27 as we wrap around the origin each time. This 


introduces a discontinuity if we try to plot the function on one sheet of the complex plane. Instead, 
we have to plot the function on multiple sheets, introducing a branch cut, for example on the first 
sheet at 3m (log(—« + te)) = im, Im (log(—a — ie)) = —i7. Similarly, with I'(s), there is a branch cut at 


t= 4m. When we calculated 
1 


G 2 = Tr? 
ae p+ m2 —imel 


(15) 


we treated it as an analytic function across the real axis. In other words, we have done the analytic 


continuation below the cut, and so, the pole lies on the second sheet. 
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L re) i 
Re(s) 


Figure 4: The branch cut in I'(s) with the second sheet on which the pole lies. 


2, One can show that, as a consequence of the optical theorem, 
2 Barn 
Im(II(p*)) = SS ie 
Ne 7 
2 
se 
= |———-—~< : (16) 


where the propagators are put on shell, replaced by 276(pj + m%)@(p!). This will be discussed in the 
problem set. 


3. Considering the propagation of an unstable particle, for t > 0, 
dw _, 1 
Gr(t, p) = (o(t, 0,p)) = ‘| eo 17 
P(A) = (6,9)600,9) = | She (17) 
with p? = —w? + p?. Hence, 
oi eR iit 
Gr(t,p) = —————= (18) 
we — imD 
= 2 
with wz = \/p* + m?. Now, for the case that l < <2, this reduces to 
Crlt,f) =5 a wat Bist 
Wi 
ase torn 
Wi 
where we have defined the lifetime r(p) = “2[-1 = Feat, which we recognise as the effect of time 


dilation, where I~! is the lifetime of the particle in its rest frame. 


y 
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2.5: S-MATRIX ELEMENTS AND LSZ REDUCTION 


Having studied in detail the general structure of two-point functions, let us now look at the general structure of 
higher-point functions. These are of course much more complicated, and the power of Lorentz and translational 
symmetries becomes much more limited. Nevertheless, there are some important statements to be made. We again 
consider a scalar field for illustration. Generalizations to spinors, vectors and multiple fields do not contain addi- 
tional conceptual insights. 


We consider 


Gr(a1,---,@n) = (0/ T(o(11) ... G(@n)) 10), (1) 
where |0) is the exact vacuum of the interacting state, and the Fourier transform is given by 
Gr(p1,---;Pn) = fan Oty eo P1-t1t.-+ PnP) Ga(ay, eae Day) (2) 
where pi = (w;,p;). Now, by translational invariance, we have that Gr(x1,...,%n) = G(0, v2 — @1,...,%n — 21), 
and so 
Gr(p1,---,Pn) « (2m)46 (py +... + Dn). (3) 
Now, if we consider any combination of these momenta, for example 
p=pitpot...+pr=—(Pryit--.t+Pa), ler<n-l, (4) 
then Gr(p1,..-,Pn) has a pole at p? = —m?, where m, is the mass of a single-particle state. 


Proof: Let us first consider r = 1, that is, consider 
Gr(D,Y1,---,Yn—1) acer (0| P(¢(x) o(y1) --- @Yn—1)) 10) - (5) 


The integration over x° = ¢ can be separated into three regions: 


je as [ate dt, 
I II III 


oo Ty TL 
See 
Ty eat —oo 
where T_ < y?,...y2_, < Ty. In region I, 


Gr(, yis---Yn—1) = (0 O(@)T(O(y1) --- 6n—1)) |0) - (6) 


We can now use the same trick as in the case of the two-point function, inserting a complete set of physical states: 
1= oh) l= > ioe 
n 


where wi = \/k2 + mi. For simplicity, let us consider a single species. Then, we have that 


— |i, #) (i, i| + multi-particle states, (7) 


Gr(e.ti--tea) = f SF gh. (olote)1E) (EiP(6n)---614n-10) 


37. > 
: | cays sag V2 FTO). n-W)10). 
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and so, for the integral over region I, we have 


I = vz if dt / dizet-P# i) = a (k IT(...)|0) 
eilw—wp)T. 
a V5 T(...)|0) 
at VF = = AT(..)10). 
Similarly, for the integral over region ITI, we find 
Ill , 7 — (0| T(...)|-p). (8) 


wwe p? +m? — te 


Region IT is a compact integral, so it does not have singular behaviour. So we conclude that, as a function of p, 


Crt) ATC), 
OTOH. 


The above argument can be generalized to any r. Consider p = py +... + pp = —(Pr4i +... + pn). Among all 


possible orderings of t1,...,¢,, we consider those with min{t;,...,t,} > max{t,41,...,tn}. In this case, we have 
Gp (a1, ---,2n) = (7) (0 T(b(21) ---4(ar))T(b(ar41) ---$(4n)) |0) + other orderings, (9) 
with + = min{t;,...,t-} — max{t,41,...,tn}. Hence, inserting a complete set of intermediate states, including 


one-particle states, we have 


Bk 1 sco 
Grlaiy--sitn) = (0) f az ay (OPO) --- lee) R) (FIP ler1) --- OE n))10), (10) 
and we proceed to take the Fourier transform Gp(a1,...,%,) —> Gpr(pi,.-.,Pn). The analysis of the Fourier 


transform is a bit more intricate than the r = 1 case. The details are left to the reader; they can be found in 
Weinberg, Volume I, 810.2. The result is that 


Gr(p1,---;Pn) ———> (2r)* 5(4) (pi +... + Dn) 


7 
Moga, <0 13 Pe) Mano (Or48y~ 94D n 1 
ars Pata o|a(P2,---» Pr) Moo (Pr+2,---+Pn) (11) 


where 


Mo\p(P2,-+-sPr) = [ev 2 Typ CPR Prete (O| T(b(0)G(Y2) --- b(Yr)) |B) 5 (12) 
and 
Myo (Pr-+25-++>Pn) = [tures BY py CT Prteurte = —iPmdn (5) T(G(0)P(Yr+2) --- $(Yn)) |0) - (13) 

Remarks: 

1. The result is generally valid for any interacting theory, and is non-perturbative in nature. In particular, 
as in the case of the two-point function, the single-particle states do not have to correspond to fields 
which appear in the Lagrangian. Further, ¢ does not have to be a fundamental field appearing in the 
Lagrangian: the same conclusion applies if one uses composite operators. For example, in quantum 
chromodynamics, pions can appear as such a pole. 

2. The result is physically intuitive. Diagramatically, it can be expressed as, when p = pi +...+ pr is close 


to on-shell for a particle of mass m, 
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Pr Pr+1 Pr+1 
: Pr+2 Pr+2 
(14) 
P2 : pone 
Pi Pn 
where the internal propagator contributes a factor of oe Similarly, 
Pr Prt+1 Pr+1 
: Pr+2 Pr+2 
; te: _ pe? : (15) 
p2 : ws 


P1 Pn 
2.5.2: LSZ Reduction 


Now, let us take p1, p2 to be simultaneously on-shell. That is, p? © —wy,, p? * —wy,. Then the Green’s function is 


given by > 
Gras avsPn) 9 oP LE (Bf Malta) Onl) PP) (16 
If we now take p® + w,,,...,p° © wy, we find 
n : 
Ge(prspas---sPn) —> TY ak ae (POs Pal “Pts —Fe) (17) 
giving the S—matrix element (p3,...,Pn| —P1, —P2). This suggests that we can calculate the S-matrix elements 


using Feynman diagrams as follows: 


1. Consider all Feynman diagrams for the Green’s function G(p1,..., Dn). 
2: Put all the external momenta on shell: 
p? + —wy, for initial momenta, (18) 
p} + wp, for final momenta. 
3. Obtain amptutated amplitudes by discarding the external propagators. 
A. Since an external propagator behaves as wie. near the mass-shell, we have 
Pr Pr+1 
Wl=TIv% “a (19) 
G21 P2 : 
Pi Pn 
where the shaded vertex denotes all amputated diagrams. 
2.6: THE OPTICAL THEOREM 
The optical theorem is a simple consequence of the unitarity of the S-matrix. Since 
Shy =a, (20) 
where it is convential to write S = 1+ 7T, we have that 
~i(T —T') =T'T. (21) 
We now take the matrix elements of this equation between some states 
~i (b|(T —T*) |a) = (6 TT |a) (22) 
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where we have (b| T']a) = M(a > b), and so (b| Tt |a) = M(b — a). Inserting a complete set of states into the 
right-hand side of this equation, we obtain 


Q|TIT ja) = Yo (| T* |n) (n| Ta) 


= S° M(a> n)M(b> n)x, 


and so we obtain the result 


i[M(a > b) —M(b> a)x] = So |M(an)/’. (23) 
In particular, if we take a = b, we find 


25m(M(a > a)) = 5° |M(a> n)|*, (24) 


relating the imaginary part of forward scattering to the total cross section. This result can also be applied order- 
by-order in perturbation theory. Recall the example we discussed in lecture 10: 


rH =< |. (25) 
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3: GENERAL ASPECTS OF QUANTUM ELECTRODYNAMICS 


3.1: RENORMALIZED LAGRANGIAN 
Consider the Lagrangian of quantum electrodynamics in terms of the bare quantities: 


1 yo st : 
L= = Gis — ide (y" (0, — ies AP) —mp)vp. (1) 


We use the convention: 
{yf = anh”, (2) 


w=-1 W=-% W=%, 
7 2 
7) = wryo, Lk} _ key, k = ag 
where, in four dimensions, (n"”) = diag(—1,1,1,1). We expect to find the mass and field renormalizations. 


Note: we will omit the “B” signifying bare quantities in what follows. 


along with vertex corrections 


ok eh Ae ie (4) 


We will look at how to introduce renormalized quantities. 
3.1.1: Fermion self-energy 


We have that a 
Saa(x) = (0| T(va(x)v%a(0) |0) , (5) 
and 


1PI=— 


>>) 
B+ ee — t+t>GQG-°* +. 


29 {Ba (BVA SoC Sh erSu =D Sal bse aaw, 
Gy ba 
9 1+ So : 


Sap(k) = « 
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where we have omitted the spinor indices in the second and third lines: these are to be read as matrix equations. 
Hence, we have that 


St Sa (7) 
Recall that 
—1 
So =- ‘ See es ee (8) 
ik +mep 
and so 
S* = —(if + mp) +, (9) 


and we have for the fully interacting two-point function 


—1 
S= ; 
ik+mp—ie—d 


(10) 
Note that © = X(#), since it can only depend on # and k?. Even though it is a function of matrix, we can treat it 
as an ordinary function. The physical mass is defined by # = im, so that 

—m+mep— X(im) = 0. (11) 
We note that, again, © will be divergent. Near the pole, we have that 
A Bes 


Sx —— 12 
ik +m —ie y) 
with Z, To1+i 4 as The relations between the bare and physical quantities are given by 
mp =m+om, we = V Zow (13) 
where dm = X(im) and w is the renormalized field. 
3.1.2: Photon self-energy 
Similarly, we have that 
Dyv (x) = (0| T(AR (2) A? (0) |0) , (14) 
and 
1PI = ill 
Duolh) = nn te OGY + » Cre» + 
= Do(k) + Do(tIl) Do + Do (tI) Do (411) Do see 
1 
= Do>——— 15 
oT TLDS eo 
Hence, 
(iD)~' = (iDo)~* — IL. (16) 
Recall that 
1 keke 
i — BY (1 
0 0 k2 = te 1) ( f) k2 
1 Vv Vv 
a ore [Pr’ + Pr], 
where we have defined the transverse projector Pp” =n" — ce , and the longitudinal projector Pf” = a . Note 


that it is not a coincidence that the propagator can be built from these two tensors, n’” and kk”: they are the 
only two two-tensors allowed by symmetry. These projectors satisfy the properties 


PRP) = PE, PIP = Pi, PRP) = 0. (17) 
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Note: T and L are just labels here, and the placing of these indices does not carry meaning. Hence, we have that 


CD are Pe + P| (18) 


We may also expand II*” as 


Tr” = PRY fir(k?) + PEY fi (k*) 
60 kHkY 
= 1 fr + Sy (fz — fr). (19) 
Therefore, 
-1 V/7.2 Vy oe 
2D)" = Pe’ (k® — fr) + PE’ g ft), (20) 
and we have for the full interacting photon two-point function, 
pa -;i| pv —_1_ 4 per» _1 21 
Sa ner ae es BE, (21) 


We observe that if fr..(k? = 0) 4 0, a mass will be generated for the photon. Because II“” comes from 1PI 
diagrams, it should not be singular at k? = 0, and so f, — fr = O(k?), as k —>+ 0. We will show that gauge 
invariance ensures that no mass is generated from the loop corrections. 
3.1.3: Ward identities 
Consider the path integral for the generating functional: 

Zuma) = | DA,Dyodel%4 (22) 


where S = Sgopp+f da J,ARt+ive +upn, where we note explicitly these couplings are in terms of bare quantities. 


1 Je 1 
Lond = —-FyyFY —iwW(y"'Dy — mj) 


7 - 3g (uAny”. (23) 


We define the generating functional for connected diagrams, W [J,., 17,7], by 


A Sastin) = ew em, (24) 
For example, 
- OW [J,,7, 7 
0] Tba(x)be(y)) (0) = i——2o— ; 
(0| T(a(w)Bay)) |0) GAD nities 
$: ow [Jus 7; 


(0| T( Ag (x) A? (y)) 10) 


, 20 tant) 
bJH(x)dT”(y) J=n=n=0 , 


Recall, for infinitesimal gauge transformations, 5A, = 0,A, Ow = iegprAY, and dy) = —ieg dw. Consider a change of 
variables in the path integral: 


Ay, — Aj; =A, + 06A,, 
oy > p=yprdy, 
yp > W=o+5y 
Then we have : 7 
i: DAL DYy'Dy' elu] — i: DA, DyDpers anv), (25) 


as this is just of a change of the dummy integration variables. Note that the measure is unchanged by this shift: 


DA) Dy Diy = DA,DYDy, (26) 
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and the action for the two sets of variables are related by 
a a4 1 : 23 ; TA 
S [Ave] =S [Ave] = ; f te 0, AHO? A + [es J, 0" + teprANY — iepAYn. (27) 


Hence, we must have 


/ da M(x) i DA,DUDpes4¥4 20%, a" Sp eh Gana U: (28) 
sities 
ah ism =@ a 
We) ~ 5 = Zaqay 
ee sw 


“onla) — ~ dna)’ 


we have that 


1 6°W 
070" + A564 TY 0, 29 
€ JH (x)dJ”(y) J=n==0 - 
that is, 
any) [ (4) = 
go Dy (a — y) + OO" (x — y) = 0, (30) 
or, written in momentum-space, ; 
— PR" Dav(h) +k, =0. (31) 
If we now write 
Duv(k) = Pj, Dr(k?) + Pi, Di (k*), (32) 


with RAPE = k,, the Ward identity reduces to 


Zh Dut?) +h, =0, a3) 

and so . 
pips 34 
Fe ara (34) 


and the longitudinal part of the two-point function is completely determined. Comparing this with (21), we find 
that f,(k?) = 0, and we thus conclude that II” is purely transverse. That is, from (19), we have that 


keRY 
me” = G ea ) fr(k?). (35) 
For II“”(k) to be non-singular at & = 0, we must have 
fir(k’) = k°IM(k?), (36) 


where II(0) is non-singular. Hence, for the two-point function in the interacting theory, we have 


pea Fi éPl (37) 
Wek? = Ge | 1—TI(k2) PY |" 
Remarks: 
il The longitudinal part of D,,, does not receive any loop corrections: it is completely determined by the 


Ward identities. The physics should not depend on this part. For example, in the Landau gauge, € = 0, 
Dyyv is purely transverse. 


Lecture 12 


8.324 Relativistic Quantum Field Theory II Fall 2010 


Since II(k?) is non-singular at k? = 0, the photon remains massless to all orders. There are exceptions 
to this: it is not true in quantum electrodynamics in 1+1 dimensions, or in theories where an additional 
Higgs field is introduced. 


The residue at the k? = 0 pole is given by Z;' = 1 — II(0), and we have that 


_ pT (38) 
— 1€ 


near k? = 0. The renormalized field is given by A? = V/Z3A,. 
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We continue our analysis of renormalization in quantum electrodynamics from last lecture. 


3.1.4: Charge Renormalization 


Consider the vertex corrections: 


Guaplki,ke) = [tnd'n eli tn—tha-¥2 (| T(A,, (0) Ya(yr) a (y2)) [0) 
ka, B 
= ; (1) 
ko — ki, 
ki, 
where, again, these are defined in terms of bare quantities. We introduce an effective vertex, [, defined by 
Gap (ki, ko) = Duv (ke — 1) Sas (ki PS) (Fr, k2) Sag (ka). (2) 
In perturbation theory, 
The (k1, ka) = + ene. 
= —tepyigt---. (3) 


We note that only 1PI diagrams contribute, by the definition. We will now show that gauge invariance, in the 
form of the Ward identities, puts important constraints on the structure of TM B Acting on the generating 
functional for connected diagrams, and setting J, = 1 = 7) = 0, we have 


WwW 
Oa(x)dng (y2) 


OW 
bIu(x)O7o(Y1)One(y2) 


5) (a — yo) 


J=n=7n=0 


1 
0? aH 
g 


2 
=iep se Yi) wal 
fSy= 


7a (y1)Ona(x) 


= 


or, equivalently, 


20°" (0|T(Ay (O)tba(u1) Gave) 10) = en [5(e ~ 41) (01 F(a) a(y2)) 10) ~ 8 (e— ye) (01 Tan )eh9(x)) [0] 
(5) 


Changing basis to momentum space, we have 04 —> ig, where q = (kg — ki)#. We can set x = 0 by applying 
J d*yid*yz et*1-¥1—*2-4¥2 on both sides, giving 


— ZF al" Dyv(a) Sas (Fr PS (ha, k2)Sxa(k2) = ep [Sap (ko) — Saa(hi)] . (6) 
In the last lecture, we showed 2070" Dy = —k,. And so, the result, when written in terms of matrices in spinor 
space, reduces to 
—S(k1)(qT")S(k2) = ep (S(k2) — S(k1)), (7) 
or, equivalently, 
qT” (ki, ke) = ep (S7*(ko) — S7*(kx)) , (8) 


where q = kg — ky. This is an important constraint. To see the implications, we consider k; = k, k on-shell and 
q — 0, meaning kz is also close to on-shell. Then 


1 
S'(ki) & zie -m—ite)+... 


1 
S—1(kg) ~ ge lat eA) ees 


r 
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where m here is the physical mass. We then have 


CB. 
wl" (k,k) = Zit (9) 
or ! 
I'(k,k) = — By (10) 
2 
when & is on shell. The physical charge we measure should be 
k 
oe 2 
k 
= —tephys'y". (11) 


where & is on-shell and we are using the physical fields. In other words, consider 
GAPPYS) (ky ka) = [and'n ern eha-42 (0) T(A,, (0) Vo(yr)b, (ye) 10) (12) 
where the fields are now the physical fields, rather than the bare fields. Then 
GPP) (ky, ka) = Du (Q)S (ki) P pnys (Ai, k2) S$ (ka) (13) 
where D,,, and S are again here for the physical fields. Since 


AB = /Z3An, D3, =ZsDw, YP =VZop, S? = 228, (14) 


we have that 
GB = V/Z3(/Z2)? GO), (15) 
where G? = D?SFTF SF and Gehys) = DST hs) S$. From this, we have that 


I’ (k,k) = /Z3 Zo B(k, k) (16) 
and so 
e= V/V Z3€p. (17) 
The dependence of e on Zz cancels precisely as a result of lp « F. That es = /Z3 only depends on Z3, the field 
strength renormalization of the photon, has important implications: the ratio is universal for all charged fields. 
Suppose that ey" = esfeetron. Then it is necessarily true that e?7?te” = ecletron despite the proton and electron 


interacting very differently and having different masses. If ae depended on Z2, for example, then we would have 


an extremely difficult time in explaining why e?7°" = e*ectron_ as their respective values of Z2 are very different. 
Finally, in terms of renormalized quantities: 


ql’ (ki, kz) =e [S~* (ka) aay S~*(k1)] . (18) 
We note additionally that for kj and kz on-shell, but ky 4 ko, 
ql” (ki, kz) = 0. (19) 


This is an example of a large class of identities. These are known as the general Ward identities. These identities 
are obtained by acting on the generating functional for connected diagrams with 


5 5 5 6 “) 6 


mee ae say 20 
Bo, (21) Bg (e2) ” BH(an) dma) n(x) Sea) on 
and then setting J, = 7 = 7 =0. The resulting expression is most transparently written diagramatically in 
momentum space: 
Pr ky dn Pr dn Pr dn 
oS en, a-k |-| ptk (21) 
ms v2 + p2 i q2 
Pi 71 Pl 71 Pl 1 
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where each external line should be considered as an exact photon or fermion propagator, except that associated 
with k, on the left-hand side, which should be amputated. 


Remarks: 

ike Suppose we attach to [’ a photon propagator at the q”. If all the external propagators are on-shell, 
the longitudinal part of the propagator does not contribute as the inner product of this with q” is zero. 
This enforces gauge invariance. 

2 Suppose we attach to I’ an external photon line, that is, ¢,[“(k,...). This is invariant under €,, —> 
€,, + ky, which is a gauge transformation A,, —> A,, + 0,A. 

3. Only charged particles need to be on-shell. Other photon lines or any other neutral particles (if they 
exist) can be off-shell, since they they do not transform under gauge transformations. 

4. For Ward identities to be valid, regularization should preserve gauge invariance. 
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We now consider the Lagrangian for quantum electrodynamics in terms of renormalized quantities. 


1 oS . 
L= — 7 FFs — ibe" (Ou — iep A?) — ma)bs 


; ; : 
=~ 7 %sFuv FY — iZo(" Oy — m — dm) — ZreAybyd. 


We know from previous lectures that there is no mass term for A, that the bare and physical fields and couplings 
are related by 


Al = VZ3An, 
bp = VZov, 


mp = m+om, 


1 
ep = —~$§e 


VZ3 


and that there is no renormalization for the gauge fixing term. These results are a consequence of gauge 
symmetry, enforced through the Ward identities. We split the Lagrangian into three pieces: 


L= H+ 07,4 La, (1) 


where we have 


1 a 
Lo = — gq Fw R™ = ip(y" On ~~ m)w, 
Li 7 —eA,wy"y, 
1 = 
La = ~7(4Z3— 1) Fu PMY — i(Zo — Io(q"d, — m)p 


—iZ,dmap — (Zp — 1)eAyby"y. 


Lp is the free Lagrangian, “% is the interaction Lagrangian, and %, is the counter-term Lagrangian. The 
parameters Z3 — 1, Z, — 1 and 6m are specified by the following renormalization conditions: 


1. For the spinor propagator, S(k) = oma? 


Ur gi ie 0 (Physical mass condition), 
dd : as 
— = 0 (Physical field condition). 
dk k=-im 


if bg 
2. For the photon propagator, Dik) — eo ihe: 


|,9 = 0 (Physical mass condition). (2) 


These three conditions allow us to fix our three parameters. We note that there is no need to introduce conditions 
on vertex corrections, and so, # is written in terms of physically measured masses and couplings. From this 
deconstruction, we acquire a set of Feynman rules for the interaction and counterterms in terms of the physical 
propagators. 
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mame = "Suv 
p k2 + ie’ 
2 1 
D ~ ifk—m + ie’ 
=. ey, 
mreKrns = Or (Z3 - 1)(k? gt” — ke’) w O(e?), 
Pp 
—— = -1(Z, — 1) (ik — m) + Z26m ~ O(e?), 
Pp 
= —i(Z,—1)ey" ~ O(e?). (3) 


3.2: VERTEX FUNCTION 


Consider the effective vertex we defined before: 


ee k) = ll 


Ss Slope (4) 


This is the physical vertex: it captures the full electromagnetic properties of a spinor interacting with a photon. 
As we showed in the previous lecture, the Ward identities impose that 


I“ (k,k) = —iey" (5) 


pan 
interaction with a static potential, measuring electric charge. We will now proceed to examine the general 


structure of [!(k1, k2), with k, and kg on-shell. We will discuss the physical interpretation, and we will compute 
the one-loop correction explicitly. For general k? = kj = —m?, q? = (ky — k,)* #0, the process being described is 
an electron interacting a general external electromagnetic field. From Lorentz invariance, we can build I’ from 
a", ki! and kf. Hence, 


when & is on-shell, with e = ep being the physical charge. We note that in this case, g = 0, and so this is an 


iI" (ky, ko) = A+ i(ke + kB + (ko — k)C, (6) 


where A, B, and C are 4 x 4 matrix functions of ky and kp. But, since k; and kz are on-shell, and I“ always 


appears in a product as 
tis (ko )P" (ky, ko )us(k1) (7) 


where us/(k1) and tis(k2) are on-shell spinor wave functions, we can then simplify I’ with the understanding that 
it will always be found in this combination, using the on-shell spinor identities 


kus(k) = —imu,(k), 
tis(k)k = —imi,(k). 
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Hence, A, B, and C are scalars, and functions of the scalars k?, k3 and k,.k2, or, equivalently, of g? and m. From 
the Ward identities, we have that 
qui” = 0, (8) 


and, as tis/(k2)gus(ki) = 0, and q,,(ki' — kf) = 0, only the term in C on the left-hand side is non-zero. We therefore 


have C = 0. It is common to rewrite I using the Gordon identity. Defining o“” = 5 [y", 7], this result states 


tig (ko) Y"us(ki) = 5nile (ka) [(hy + hE) + iquo*”] ws(h1). (9) 


This allows us to exchange the term in B for a term in a”. 


Proof 


us (ka)y"us(ki) = a [tis (ko) 7" hy Us (kK) a5 tis (ko) Roy"us(k1)] 


2 
a kay + Ky kay _ Ky 
2m 2 2 


kay + kay kay — Kiy a v 
p (Feet 4 Bee) anda atatt)| 
_ a = kay ay ky Ta kay ai iy wowv 
= tn [( tH) rary — (eSB) nwt 


a : v 
= Bp ts’ (Ka) [Cha + kt) + tquo™] us (hr). 


) tis (ko) y"y" us (k1) 


From this, we find that 


iT (hy, ke) =e |“ F(@?) — 2 FA(@?)| . (10) 


F\(q?) and F4(q?) are known as form factors. We have that eF\(q?) = A+2mB, and eFo(q?) = —2mB. Note that 
the Ward identity means that Ff) (0) = 1 exactly. 
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3.3: ANOMALOUS MAGNETIC MOMENT 


In the last lecture, we showed that the physical vertex I“(k,, ko) takes the general form 


“Ty LL 2 oN Gq, 2 
iT" (ki, ko) =e |y"Fi(g’) — 5 F(q°)| , (1) 
m 
and in the limit ky — kp = q —> 0, 
oy yO Ge =o 
ww (k1, ka) 7 ely 5) F,(0) = Depp (ki, ke). (2) 
m 
This vertex is reproduced by the effective Lagrangian, 
oe = teFo(0) - 
Lopp= =n (yd, — my — eA ye — teFA0) Fi Fi. (3) 


4m 


Consider the case where ~ is non-relativistic, in a classical electromagnetic background A,. That is, 


pi~mv?+m prmy, 


2 


A° ~ mv Aw~mu, 


where v < 1. This is consistent, because D“ = 04 — ieAX = i(p— eA)“, so, A° and A interact with 7) and give it 
energy of the order mv”, and momentum of the order mv. The Dirac equation now has the form 


(7 (Ou — teAy,) — m)a 4 PO) pot) =0, (4) 
™m 
or 10, = Hw, with 
H=mB+4.(p—eA)+eA° 4 eh) OGHY Fy. (5) 


Here, 8 = —i7° and a’ = —iy°y'. We will choose the basis 


or, equivalently, 


From this, we find 

6200 = 2 oo 0 ot 

Ae Se. oe bel =( iat 
Lehre ge aR ee 50) 
51 Yi y |] =e oy oa 


We now write ~? = ( ox ) , Fo, = E, and Fy; = €ij~B,. The Dirac equation reduces to two coupled partial 
differential equations: 


I 


yo 


> Le F: = ss 
i0.6 = mo+é.(p—eA)x+eA°ot [a8 7.B 6] ; 
m 
=, Lek E. sf 
ix = —my+a.(p—eA)o + eA°y + eh) Eo 4 id. By| 
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We now let ¢ = e7~?™®, x =e? X. As id ~ [m+ O(mv?)] , ® and X describe fluctuations with 
AE ~ mv”. In terms of these fields, taking the limit v —> 0, the equations reduce to 


as : 
id = sax tea + HBO |-i# Bo) + O(v), 
2m 
0 = —-2mX 4+ e.#6 + O(v7), 


where 7 = p— cA, and so, solving the second equation for X, and inserting the result into the first equation, we 
obtain 


1 
1 F(0 ~ 
io = L@ae+ ed + 220s zo. 
2m 2m 
Now, ae ae 
(6.2%)? = ojo;n'n? = (bij + teijnon)a'T! = 1? + eG .B, (8) 
as [as 7d | = —ieF) = —iee;;,B,, and so we arrive at the following time-evolution equation in the limit v —> 0: 
1 = 
id,® = | —(p— eA)? + eA° + (1 + F4(0))G.B| ©. (9) 
2m 2m 


We recognise the first two terms as the kinetic energy of a particle in an electromagnetic field and the electrostatic 
potential energy, respectively, and the third term takes the form of a magnetic interaction, 


Amag = —i.B, (10) 
with e 
P(E RO) 46. (11) 
L MN ! 2 9 ey: ’ 


with 5 = g the spin, and y = 5" g the gyromagnetic ratio. Classically, we expect g = 1, and in the Dirac equation 
of quantum mechanics, we find g = 2. We see that in the case of quantum electrodynamics, we have 
g =2+2F (0). The additional term of 2F2(0) is known as the anomalous magnetic moment. We will now 


explicitly compute the lowest order correction to the magnetic moment. 
3.3.1: One-loop correction to the magnetic moment 


To lowest order, the correction to the physical vertex function is given by 


kg 
It (ki, kg) = 
ky 
-\3 dl p Hb v F(0) 
— (—ie) / (Qn)? So(ke + ly So(ky + ly De (1), (12) 
where So(k) = wi and DY = spel Explicitly, we have 
di‘ ? [i(k +2) +m] y# [ih +1) +m 7” 


P# (kn, ke) = (—ée)8(-1)2(—8) / (13) 


(2m)4 ((ky +1)? + m? — ie) ((kg +1)? + m2 — ie) (12 — ie)” 


We can combine the denominators using the Feynman trick, 


1 ip [ 2 
—__ = dx dx dx30(a1 +2242 1 , if 
A, A2A3 fe ie ay ee Meat: eal + 9A + 23.A3)° ue 
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reducing our result for I‘ to 


0 


1 1 1 Ne 
iS 2e° f ary | dre f dx30(a1 + 22 + 23-1) De’ (15) 
0 0 


where 


NY =" [iho +1) +m) 7" [ih +4) +m] w, (16) 


and 


S 
| 


L1 [Ri +1)? +m] + £2 [(ho 1)? 4 m?| + x31? — ie 
(1 + ryky + rok)” + ay(1 x1)ki t xa(1 xo)ke 221 29k1.k + (11 + r2)m — 1€. 


We may shift the variable in the integral 1 —> p = 1+ 2k, + xok2, and rewrite the result in terms of q? instead of 
ky.ka, giving 
D=p?4+a,229q7 + (a1 + £2)°m? — te. (17) 


Further, 


N# ” [i(p— ei, + (1 — @2)ho) +m] [i(p t+ (1 — 21) Ry + voky) +m] wv 
= Vp pr +7” [ileiky + (1 — 2) hg) +m]! [i((1 — 21), + voy) +m] w 


+terms linear in p. 


I 


The terms linear in p evaluate to zero in the integral, as they are odd, so we can discard them. The first term can 
be evaluated using the identity y’dp¢y, = —2db¢, resulting in 


= py py = —2ppy" + App" = —2p?7" + Ap’ py. (18) 


This last term is again odd in the individual components of the momentum integral, and so reduces to ? “ght Vw 
inside the integral. So, the contribution to the integrand from the first term is 


—pyt. (19) 


We see that this term contributes to F\, and we know by the Ward identity that F\(0) is zero. We can disregard 
this term here. The second term is p—independent, convergent in the ultraviolet, and contains a contribution to 
Fy. We use the identities 


yee 4g, 


Vy = —2y%, 


l| 


and so the relevant contribution to N“ is 


N® = —2[-txeka +i(1 — 21) Ki] 7" [-taiky + i(1 — 22) ho] 
+4m [i(1 — 2a1)ki + i(1 — 2ar2)ke] — 2m? y". 


We discard the last term, which again contributes to Ff. We again use the fact that [’ appears in the 
combination u(k2)u(k1) for on-shell spinors, and that uf, = —imu, fu = —imu, for on-shell solutions. So, the 
relevant part of N” in the integrand can be written as 


N¥ = 2[—rom + i(1 — 21) K,] 7" [-vim + i(1 — x2) ho] 
+4im [(1 — 221) ki + (1 — 2a,)ke] . 
Using the identity fy“ = —y"# + 2k“, and again retaining only the parts contributing to Fo, the relevant part of 


N* reduces finally to 
N¥ = 2im(az1 + @)(1 — 21 — 2) (KY + ke). (20) 


Thus, we find 
Py = yh(..-) + (aT + 2) B, (21) 


1 1 1 A ‘ 
d 2 1-2, - 
B= 20° f i. | dx dx2dx36(x1 + £244 x3 yf ( a wu: aE 2)( za 2) ‘ (22) 


0 2n)4 (p? + a1 H2q? + (a1 + 22)?m? — ie)? 


with 
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Im(q) 


Ka ~s Re(q) 


Figure 1: Illustration of the Wick rotation of the variable qo. 


Applying the Wick rotation, p° = ip}, dp = id*pp, we can explicitly evaluate 


d‘pp 1 1 
Lae ETAT ~ HS " 


and so leet 
1 x3(1— 23) 
B=—4me? —~ dxidxod 1 24 
me 3272 | | [ XL1AXL2 £30(X1 +2%2+2%3 ae + (ql = a3)2m2’ ( ) 
and, finally, for the form-factor F2 we obtain the result 
2 2. 1 1-23 
FO) = -Sa@=0=5 f df dn 
e An 0 0 1 — @3 
7 gan opt , 2 a 
Ga fy 9999 = Bee = Oe 
where a = e ~ Ere and so 
g =2+2F2(0) =245. (25) 


g-? _ © — ().0011614.. at one-loop level. Experimentally, a. = 0.00115965218073(28) (Gabrielse 2008). 


i aE, ae 
Theoretically, the result is decomposed as 


a ay? a\3 a\4 

aa) eee (=) cae (=) sedi (=) (26) 
27 WT 1 T 

where the second coefficient, a2, consists of seven diagrams, and was calculated in 1957. The third coefficient 

consists of 72 diagrams, and was calculated in 1996. The fourth coefficient, a4, consists of 891 diagrams. 
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Firstly, we summarize the results of the vertex correction from the previous lecture: 


I (ki, ke) — 


= yA(g?) + i(k, + ke)" B(q?) 


Fol) 
= HB (g2) — Oo W#'2 1 
ey" Fi(’) Sao (1) 
where eF; = A+2mB and eF, = —2mB. We showed that F2(0) = —2B(0) = = = 0.0011614.., where 
a= es ~) a The integral for I’ is, in fact, infrared divergent. As 1 —> 0, 
(kit? tn? = B+m?4+2h1+0F 
= % i+? 30, 


and sol, ~ [ alé is divergent. This is due to soft photon interactions, and this effect is in fact cancelled by 
including the soft emissions: 


The explanation is that it is only reasonable to calculate measurable cross-sections: 


(2) sean 7 (in) esas @ erp dee PEelent) (3) 


The calculation then proceeds by imposing an infrared cut-off A on the photon momenta. The divergences in the 
A — 0 limit cancel among virtual and real soft photon emissions, and we can safely take the 1 —> 0 limit in the 
end. 


3.4: VACUUM POLARIZATION 


We will now evaluate the one-loop correction to the photon propagator, and consider the physical interpretation, 
recalling the general structure we considered in lecture 12. 


3.4.1: One-loop correction 


=i0 2 
e’(k) = WY =~ rte 
’ k+q 
= yi) | saith OM Solh-+ a)” Su(q)) ~ Zs ~ IHEP (4) 
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We note that the factor of (—1) at the front comes from the fermionic loop, that the trace is over the omitted spinor 


indices, and that So is the electron propagator. Having now enough experience with one-loop diagrams, we will 


omit the details of the calculation and only emphasise the new aspects. Using So(q) = ; 7 = i z, we first 


introduce the Feynman parameters: 


ANY 
ve A aw f (+Dy + counterterm, (5) 


with p=q+ 2k, D=x(1—2x)k? +m? — ie, and 


4ANMY = tr [y*(ik + ig + m)y” (ig + m)] 
tr [y" (ip +i(1—a)K+ m)y" (ip —ixk + m)| 


= tr [opr p] + mtr [47] + 21 — 2) te [YE 
+terms linear in p+terms with an odd number of ymatrices. 


We note that the trace of a term with an odd number of y-matrices gives zero, and that 


yy | San 
Ei ae |) AE ety sea sea ap ee 
Hence, disregarding irrelevant terms, we can write 
NH = —2pHp! + 20(1— a)kMRY + (m? +p? — 2(1 — 2) k)n! (6) 


Secondly, we evaluate the integrals, extending to a general dimension d. We note that 


d“p LY £2 _ dp ae 2 
| airrte)= 7 (nyt? f(p*), (7) 


and so 


dy Ant 1 — 2)n¥Y + Qn(1 — 2)ktky 
iT" (k ase [ia ef = mi + pl — a)n + 2a(1- 2) 
(p? + D)” 
where we have set A = m? —a(1—2)k?. We note that the first and third terms in the numerator are logarithmically 


divergent, and the second term is quadratically divergent. We now apply a Wick rotation p® + ipt, d‘p — id’ pp 
and p* —> p%. We recall that 


(8) 


oe eg ee ee (9 
(21)* (pe + D) (4m)? P(b)T(4) 
and so F : F 
2, [dpe iP dpe 1 
(1 af os ao pf eer 5: (10) 
(27)" (pp + D) (27)" (pp + D) 
Hence, the numerator of (8) can be replaced by 
(A= D)nh” + 2a(1 — a)khk”. (11) 
The transverse component, therefore, is given by 
—2x(1— 2)k? PRY, (12) 
and we can write 
7 Vy 22.2 E( oe - 2 pEV 
id'’(k) = —B8ie*k" Ph i(Z3 — 1)k° Pr 
(pe + Dy 


T(2-4 = 
= wvmuiteD dx sees) i(Z3 — 1)k? PRY. 
(4)? Jo D3 
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Thirdly, we use dimensional regularization, setting d = 4 — €, e > ep?, 


say —8ie?k? PRY (k) ft 2 Amp? Z 
it’ (k) = mf dz x(1— 2) ee a ar i(Z3 —1)k? PRY, (13) 
and so 
Te (he) = kee Cee), (14) 
with oe 
e€ 1 1 D 
I(k?)=—-— | dea(l l Z3 — 1). 1 
0) =-5 f avatt—2)2 - 5108 (a=) - (a0) (15) 
The physical field condition constrains that II(k? = 0) = 0, and so Z3 is fixed by 
e1 1 m? 
Z3=1 ] 1 
; Ome 2 °° (5) tf) 
and the final result for II(k?) is 
2 pl 2 
2 e k*a(1 — x) 
II(k*) = x | dx x(1 — x) log (1 te (17) 
Remarks: 
1. For k? < —4m?, II(k?) becomes complex. 
2. For more than one charged particle, we must add their respective contributions, and the smallest m 
contributes most. 
3. The internal propagator is given by 
aagis2, pv 
2 Dev ve 1) 
Ce q? — te 1 — II(q?) 
ier nh 
= = (+0) +...) 


We see that for q? >> m?, a large spacelike momentum, from (17) we have 


: ee @ pf 
Il(q°-) = yas | dx «(1 — x) 
@ 


a 


= =] — 
37 ore 


where a = = Then, the internal propagator goes as 


e? ; e2 1 e*(q) 
q? — ie 1—Il(q?) q—ie gq? —ie’ 


(18) 


with e?(q) = as the running coupling constant. 
— flog 


3.4.2: Physical implication 


Consider scattering of two charged particles with coupling constants e; and €2g, for example, in the process e~ + 
po—e +p: 
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to lowest order, where e is the coupling constant associated with the lightest intermediate particle. 


S(1,2—+ 1,2’) = (ie)*tiy (vy) 9"*u1 (p1) Dv (q)t2 (v2)7" U2 (2), 


where - 
—1 Pog (q) 


D,v(q) = 
pu (@) q? — te 1 — II(q?) 


Di); (20) 


and Pr, = uv — “3, ¢ = Pi — Pi = P — po. Note that tig (p))7"u2(pa)qv = tia (pS) (pl, — p2)U2(p2) = 0. We now 
want to consider corrections to the Coulomb potential. We consider the low energy, non-relativistic limit, where 
we derive most of our intuition about electromagnetism from, and where the notion of a potential makes most 
sense. The lowest two diagrams drawn above correspond to 


1’ at 
€1,€2 
e1, €2,V = — “nw, 
q it 
1 2 
a 2! 
€1€2 
e1, uy Rjenv = me COU (21) 
1 2 
In the non-relativistic limit, 
lIP|~bad<«<ld, Px@. (22) 


In this case, one-photon exchange corresponds to the Born approximation: 


€1€2 


= / Bre TYA(H, (23) 


where Vo(r’) = “4% is the Coulomb potential. The one-loop correction is given by a5 11(q"), where, from (17) 


Ar|r| 
ep ? 
H(@) = x | dx «(1 —x)log(1+—sa(1-2)), (24) 
Qn? 0 m? 
and the term provides a correction to the Coulomb potential 
~~ T1(G 
6V(r) = crea [ are MD). (25) 


From now on, we will for convenience write q = idl, r = ||r||. The angular integral is given by 


(nm) 


and so we find for the correction to the Coulomb potential 


> €1€2 a i aqr —wqr 
6V(7F) = roa rl — e~"4") T1(@’) 
€1,€2 
An? 
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Figure 1: Clockwise contour for the integral in dz in (26). 


vanishing contribution. 


mr 


4/x(1—2) 


If we now let z= qr,a= 


_ €1€2 e? 
ee oe 80 ir 
The integral in dz, I = [dz as log (1 + =), 
giving 


The semi-circular arc is taken to infinity and gives a 


> 2mr, the result reduces to 


i dee a) f 


2 


z 
a2} - 


1z 


dz — log 
z 


(26) 


(1+ 


can be computed using the complex contour shown in figure 1, 


=i 
f = -2 O —in 
co —axr 

7 2in | dy = 
1 aN 


after setting A — aX. So, our result for the correction to the Coulomb potential is given by 


2 1 co d _)__mr 
oV(rF) = ae dx «(1 -2) f oN, Ve(—#) 
“An 1 Jo 1 A 
= — Z(mr) 
Remarks: 
1. When mr > 1, we can evaluate the integral in the saddle-point approximation, using integration by 
parts, 
BV (v) = €1€2 e2 e72mr (27) 
An 1673 (mr)? 2 
2: Z(mr) increases with decreasing r. As r —> 0, I(r) —> oo. If we consider putting a short-distance cut 
off at mr =e <1, we find 
e? 1 
and thus 
V(r) = Vo+d6V(r) 
€1€2 
a 14+Z 
A (1 + Z(mr)) 
_ é1(r)é2(r) 
4nr 
where ‘ 
é;(r) =e;,(1+ Z(mr))?. (29) 
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OVE 
GDG) CD 
2 


Figure 2: Virtual electron-positron pairs form a screening effect. 


We observe that €;(r) —> oo as r —>+ 0, and é€;(1) —> e; as r —> 0, with small experimental corrections. 
Physically, we can view €;(r = €) as the bare charge, which is very large. The physical interpretation is 


that virtual electron-positron pairs screen the charge more at large distances. The screening length is 


given by rs ~ 5+. That is, there is a negative cloud of size x. At large distances, the charges scale as 


9 2m 
eerwe cm, 
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4: GENERAL RENORMALIZATION THEORY 


Let us recall some of the major results and methods of previous lectures: 


ile In perturbation theory, bare and physical quantities are related by ultraviolet-divergent expressions: 
Mohys = MB t+ dm, (1) 
where Mphys is finite, 6m is ultraviolet-divergent, and so mz is necessarily ultraviolet-divergent. 
2. We express the Lagrangian in terms of physical quantites, and separate it into 
L= Lo+ 27+ Le, (2) 


where -Z is the canonically normalized free Lagrangian for physical fields and masses, 7; contains the 
interaction, again in terms of physical parameters, and “%, contains the counterterms with ultraviolet- 
divergent coefficients. From %, we obtain the propagators of the physical fields. “% and %, give 
interaction vertices. 


3. At the one-loop level, the self-energy is given by the effective two-point vertices: the 1PI two-point 
vertex of the interaction and the counter-term two-point vertex. The counterterms absorb ultraviolet 
divergences, and the finite parts of the counterterms are determined by renormalization conditions, 
which ensure the quantities in % + %; are physical. The conditions constrain the self-energy and the 
effective vertices, and give a finite, uniquely-determined value for the counterterms. 


There are several questions we need to consider: 


1. Can counterterms remove higher order divergences in the self-energies and vertex corrections? 
2. Can they remove ultraviolet divergences in generic physical observables? 

3. How does the procedure work for a general theory? 

4. What is the physics behind the success (or failure) of renormalization of ultraviolet divergences? 


4.1; DEGREES OF DIVERGENCES 


Given a generic 1PI diagram M in quantum electrodynamics, or any theory, how can we tell whether it is ultraviolet 
divergent or not? We begin by introducing the superficial degree of divergence, D. This is defined by 


D = number of factors of internal momentum in the numerator — 


number of factors of internal momentum in the denominator 


If we let all the loop momenta go to infinity with a common factor S —> oo, then 


f APS DS 0 
M~ / dSSP-1~ 2logA D=0, (3) 
finite D <0. 
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It is the superficial degree because it gives a rough indication of the behaviour. We will mention caveats and 
how to deal with them later in the lecture. We will now find an explicit expression for D in the case of quantum 
electrodynamics. Introduce 


E. = Number of external electron lines, 
Ey, = Number of external photon lines, 
I. = Number of internal electron lines, 
I, = Number of internal photon lines, 
V = Number of vertices, 
L Number of loops. 
Then we have 
D=4L—I, — 21). (4) 
We can express J,, J, and L in terms of E., Ey and V: 
21,+E, = V, 
2I.+F, = 2vV, 
I.+i,-(V-1) = BL, 
and so L, = $(V—£E,), I. =V —- Ee and we find 
3 
D=4— By — 5 He. (5) 


We see that D only depends on the number of external legs, not on the internal structure of a diagram. In order to 
have ultraviolet divergences, we require D > 0, and so 


3 
By + 5Ee <4. (6) 


Therefore, only a finite number of external lines can yield superficially divergent integrals. So we have a finite 
number of classes of divergent diagrams. They precisely correspond to the counter terms we discussed earlier. We 
will enumerate them in the next section. Let us now generalize (4) to a general theory. First, we provide an 
alternative derivation: 


P1 


M6(p, + .--) 


I 


= Fourier transform of (o(x1)...¢%(xp,)A(y1) --- A(yz,)) 
with external legs amputated. (7) 


Hence, 
M ~ APev (8) 
where e€ is the coupling constant. We have that [MM] = D, as [e] = 0, and therefore 


f d4x1...d4y, ... (W(a1)... A(yi).-.) 


[M|—4 = : 
f 4x, (b(x1)0 (21)... f d4y1 (A(yi) AC) - -- 
= -B.[v] - EB, [A]. 
We note that [7] = 3 and [A] = 1, and hence 
3 
D=|M]=4— 58. — E,. (9) 
Now, for a general theory in d spacetime dimensions, the field content is given by gf, f = 1,2,..., where f labels 


the field type. [¢¢] = Ay and A; > 0 in all physical theories. We have interaction vertices of type i, 1 = 1,2,..., 
contributing a term of the form 


(ay TT 9". (10) 
f 


Lecture 17 8.324 Relativistic Quantum Field Theory II Fall 2010 


Here, A; is the coupling constant, with dimension 


[Ai] = 6;=d—nji— So nips. (11) 
f 


Now consider a 1PI diagram in such a theory: 


Ey = number of external lines of ¢f, 
Vi 


number of vertices of type 7. 


Then M ~ APJ], \j* and so D = [M] — 30, Vid;. Again, 
[M|—d =~ By, (12) 
f 
and so the general expression for the superficial degree of divergence is given by 
D=d— J) EAs — SVG. (13) 
f i 
Diagrams which are ultraviolet divergent satisfy 


So EAs +> Vidi <a. (14) 
f i 


We note that Ay > 0 in all physical theories. We can now divide all theories into 


1. All 6; > 0: there are only a finite number of superficially divergent diagrams. These are super- 
renormalizable theories. 


2. All 6; => 0: there are a finite number of classes of divergent diagrams. These are renormalizable 
theories. 
3. There exists at least one 6; < 0. In these theories, increasing V; means increaing D, so all amplitudes 


are divergent at high enough orders. These are non-renormalizable theories. 


These terms also apply to individual interactions for a vertex of type 2: 


1. 6; > 0: super-renormalizable, relevant interaction. 
2. 6; = 0: renormalizable, marginal interaction. 
3. 6; < 0: non-renormalizable, irrelevant interaction. 


Note that a diagram can be superficially convergent but divergent because of a divergent supdiagram. For example, 
in quantum electrodynamics, in the case BE, = 2, Ey = 2, in which we have that D = —1 


(15) 


Figure 1: The first two diagrams with E, = 2, E, = 2 both diverge because they contain divergent subdiagrams, 
whereas the third diagram is finite. 
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4.2: CANCELLATION OF DIVERGENCES 
4.2.1: General Structure 


Consider a generic divergent diagram M of degree D; that is, 


A 
u~ f ise" (1) 


if all loop momenta are taken proportional to s. Generically, internal propagators have the form 


(2) 


1 1 _ {? scalar or vector, 


(as+p)%+... ~ 50 1 fermion. 


for large s, where a is a numerical constant and p is a combination of the external momenta. Differentiating M, 
with respect to p, n times gives a term proportional to 


1 1 
~ 3 
(as+p)otm = gvtn? (3) 


and so D + 1 derivatives with respect to the external momenta will make M finite. This means that 
M(p) = Mo + Mip+...Mpp” + finite terms, (4) 


where the argument p of the function represents the collection of external momenta, we have suppressed the index 
structure, and Mo,Mj,...,Mp are potentially divergent constants. Suppose that M has Hy external lines of the 
field gf. Then divergences of M(p) can be cancelled by counterterms of the form 


> 4;(0! TI oF (5) 
; 


=0 


a 


where the A; are divergent coefficients in order to cancel divergences in M;. The index structure in 4;0! should 
match the suppressed index structure of M(p). 


4.2.2: Renormalization of Quantum Electrodynamics 


In the case of quantum electrodynamics, 


3 

D=4—-E,—5E., (6) 

and hence, ultraviolet-divergent diagrams are given by 

3 
Ey + 5B. <4. (7) 
We now enumerate these classes of diagrams: 
1. E, = E. =0, D=4: These are the vacuum diagrams, and give a divergent constant Ao. The divergence 
can be removed by including a constant A., counterterm in the Lagrangian, such that 

Aphys = No Pos Net. (8) 


Apnys does not have any physical significance, unless we are dealing with a theory of gravity. Apnys is 
measured to be very small, and it acts as a cosmological constant. 
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2. Ee. =1, Ey = 0,1,2 and E, = 0, LE, = 1: These terms are all zero because of Lorentz invariance. 
3. E. = 0, Ey = 2, D = 2: This is the photon self-energy, 
TH’ (q) = 7 WY~ ve, (9) 
qd 
Expanding this term, we get 
He” (q) = ern” + caq?nt” + c3q'q’ + finite terms. (10) 
The Ward identity imposes 
ml” (q) = @ Pr’ (a) (¢"), (11) 
and so we have 
II(q”) = C + finite terms (12) 
where C is the only divergent constant. The divergence in C can be absorbed in the counterterm vertex, 
rKYY = (23 — LEY Fv. (13) 
Pp 
4. E. = 0, Ey = 3, D=1: This is zero by charge conjugation invariance, as will be shown on the problem 
set. 
5. E. =0, Ey = 4, D =0: This is the four-point vertex, 
Myypo = 
= K(uvNoo — Nuptvo + Nuc lve) + finite terms, (14) 
where K is a potentially divergent constant, and the form of the divergent term follows from symmetry. 
The Ward identity constrains that 
a Mino =, (15) 
and so K = 0, and the four-point vertex is convergent. 
6. E. = 2, Ey = 0, D =1: This is the fermion self-energy, 
(ik) = WY 
= A+ Bk + finite terms. (16) 
Here, A and B are divergent constants, and they can be cancelled by the counterterm vertex, 
k 
with the mass renormalization dm and the field strength renormalization Z2 chosen appropriately. 
Cs Ee = 2, Ey =1, D=0: This is the vertex function, 


I“(k, k’) = Ay" + finite terms. (18) 
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where A is a logarithmically divergent coefficient. This can be cancelled by the counterterm three-point 
vertex, 


= —ZeAypy'y, (19) 


where the Ward identity imposes 2, = Zo. 


In summary, the counter terms we considered before are enough to cancel all of the superficial divergences to all 
orders in perturbation theory. We still need to consider diagrams with divergent sub-diagrams. Generically, these 
can be easily cancelled since divergent sub-graphs must also belong to those considered above. For example, the 


pairings 
soe + Sa Jae + Ja (20) 


are both finite. We now need to consider the case of overlapping divergences. That is, when two divergent subgraphs 
share an internal line. These divergences are more difficult to deal with. For example, consider the diagram 


(21) 


(22) 


gives a vertex correction at the vertex 1, leaving a residual self-energy divergence for the case p'’ —> oo. Similarly, 
if p’ is taken to be large, the pairing 


(23) 


gives a vertex correction at the vertex v, leaving a residual self-energy divergence for the case p’ —> oo. The two 
procedures are not independent, and it is finally necessary to add a correction of the form 
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at the order O(e*) to cancel the divergences when both p and p’ are large. Another example is given by the 
combination of diagrams 


p p 
c eee eee A animes ae Eee a (25) 
Pp Pp 


It can be proved to all orders, the content of the Bogoliubov-Parasiuk-Hepp-Zimermann theorem, that overlapping 
divergences can be canceled by counterterms corresponding to superficial divergences. 
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The procedure described for quantum electrodynamics last lecture can be readily generalized to other theories: 
D=d—-)_ EsAs — > Vii, (1) 
f i 


with Ay > 0. For a renormalizable theory, all 6; > 0. That is, all the interactions are renormalizable. This means 
only for a finite set of classes of diagrams characterized by {E;} is D > 0, and D is always bounded by d. 
Therefore, all superficial divergences can be canceled be a finite set of counterterms of the form 


D 
S* A, [] 67 (2) 
j=0 f 


where the A; are divergents constants. From the facts that Ay > 0 and D > 0, this set of interactions are all 
renormalizable: [A;] > 0. Therefore, all divergences can be absorbed into redefinitions of the couplings, masses, 
and fields, provided the Lagrangian contains all the renormalizable interactions allowed by symmetries, in order to 
make sure the overlapping divergences cancel. 


Remarks: 

ds For quantum electrodynamics, there is no need to enumerate all the diagrams. We only need to write 
down all possible counterterms consistent with gauge symmetry. 

2. Non-renormalizable theories, with some 6; < 0, require an infinite number of counterterms. 

3. In a super-renormalizable theory, each counterterm contains only a finite number of divergent terms, 
when expanded in perturbation theory. 

4. Generally any divergence allowed by symmetries will occur. 

5. If the regulator for ultraviolet divergences does not violate a given symmetry, the divergences violating 
that symmetry will not occur. This is particularly important and subtle for gauge symmetries, because of 
the need of gauge fixing. For quantum electrodynamics, the Ward identities ensure this. For non-Abelian 
gauge theories, the generalized Ward identities arising from the BRST symmetry ensure this. 

6. There are situations in which one cannot find any regulators which preserve a symmetry, and so the 


symmetry can be violated when the regulator is removed. This leads to the concept of an anomaly: a 
symmetry of the classical Lagrangian which is not respected in quantum theory. If this happens to a 
gauge symmetry, it leads to an inconsistency. 


4.2.3: Ultraviolet Divergences and Renormalization in Coordinate Space 


Our discussion has been so far restricted to momentum space, with loop momenta as p —> ov, leading to ultraviolet 
divergences. One would normally expect from that uncertainty principle that ultraviolet divergences are associated 
with short-distance phenomena. This can be made manifest by calculating the Feynman diagrams in coordinate 
space, leading to the result that counterterms must be local. This also leads to the result that in systems at a finite 
temperature 7’, the counterterms are independent of T. We will not give a complete treatment, but we will briefly 
highlight the local nature of the counterterms. We will use a scalar field for illustration. In the free theory, 


Dy =~} (06)? — me? (3) 
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If we try to compute operators like ¢?(), ¢3(a), etc., we find that they are not well-defined, containing ultraviolet 
divergences. For example, consider 

o(x)O(y) = G(x — y)+ : ox) oly) : (4) 
where the first term is the contraction and the second term is the normal ordering. As « —> y, the first term is 
singular: G(a — y) ~ x. The second term is regular in this limit. Hence, 


ahr 
o°(x) =Col+: ¢7(2) :, (5) 


where Cy ~ =z, for a short-distance regulator e. Similarly, 


P(x) = Col + C1: P(x): +: 94 (x):. (6) 
In the interacting theory, we consider, for example, 
Lint = A$", (7) 
or more generally 
Lint = dO, (8) 


where O is some composite operator built from ¢. The general physical amplitude is given, in the interaction 
picture, by 


(T(X)) = (T( Kei #220@)y) (9) 


0 


where X is a collection of fields (operators), and the right-hand side is evaluated in the free theory. At the nth 
order in perturbation theory, the contribution is 


ae [T]ata ir (XO(a1)...O(an))o- (10) 


There are two types of divergent loops. Considering, for example, O = ¢+ : 


YG, an 


The first type corresponds to a single propagator attached to the same vertex, and the second case corresponds to 
O(x)O(Y)|,4,+ The first type of divergence can be takencare of by normal ordering O itself: 


o*(x) =Co1+ C1: ¢ (2): +: ¢4(2):. (12) 


Here, the first term is canceled by Ag, and the second term is cancelled by a mass renormalization. The second 
type of divergence can also be dealt with using normal ordering: 


Oa) HW) Nay = It iP WU): +e 04)" +g OW) FZ PW) te. (03) 


Here, € = |x — y| is a short-distance regulator. The first four terms are not integrable, but can again be canceled 
by adding 


Ao + Aig? + Az¢* + Aa(O¢)? (14) 
counterterms to the Lagrangian. In general, 
Oa) POY) St ay = 2 Cnle) (15) 


where the C;,(e) ~ & are ordinary functions, and the : O,,(y) : are normal-ordered local operators. Here, 
aI — Ag, (16) 


where [O] = A and [O,] = A,. This expansion is known as the operator product expansion, or the OPE. The 
structure of the OPE is such that the counterterms are always local. The OPE will be discussed in more detail in 
the problem set. 
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5: THE RENORMALIZATION GROUP 


The renormalization program has been shown to be a success operationally: 


il For a renormalizable theory, all ultraviolet divergences can be absorbed into redefinitions of bare quan- 
tities, which are not observable. What is meant by the set of physical observables is unambiguous: it is 
well-defined in terms of a small number of physical couplings, masses and so forth. 


2. The requirement of renormalizability is a strong constraint on the physical theories we should con- 
sider. Theories satisfying this constraint included quantum electrodynamics, Yang-Mills theories and 
the standard model. 


Conceptually, we want to understand why it works. Many of the older generation of physicists did not accept 
renormalization when it was first introduced operationally. There are several difficulties with this: 


1. The process is perturbative in nature: it has to be carried out order-by-order in perturbation theory. 
Also, when carrying out perturbation theory, with a Lagrangian of the form 
Pe. o Loe Gus 
= —5 (06) — smd? — 268 — Lu, (1) 


for example, with YZ, = —4A (a6)? — 3Bm?¢? — 4C¢%. The coefficients A, B and C are found to be 
divergent, yet they are considered to be small quantities in the perturbation expansion. 


2. Non-renormalizable terms are generated by loop corrections, such as the anomalous magnetic moment 
term, py” WFi,,. Physically, we want to understand why this is not a bare term. 


3. Conceptually, it is troublesome to assume that a particular theory like quantum electrodynamics should 
be valid to arbitrarily high energies. 


5.1: WILSON’S APPROACH TO FIELD THEORIES 


Let us consider Wilson’s basic starting points, which should be viewed as the postulates of the theory. 


1. A quantum field theory should be considered as an effective field theory, valid only in a certain range 
of energies. In particular, it should be defined with an ultraviolet cut-off Ag. For example, in quantum 
electrodynamics, we should choose Ag ~ 1TeV, and we should only expect the theory to be valid for 


E <« No. 
2. Physical phenomena at a certain energy scale E’ are most appropriately described by the degrees of 
freedom at that scale. For example, for a scalar field 


o(x) + ok), k € (0,00), (2) 
the most relevant degrees of freedom should be $(k ~ E). 


At the moment, these notions have only been expressed philosophically. The key is to actually implement these ideas 
procedurally. We will use the example of a single real scalar field for illustration, working in Euclidean signature. 
The simplest cut-off procedure is to restrict 

Ik < Ao, (3) 


though other cut-off procedures can be considered. The basics ingredients of Wilson’s approach are as followed: 
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Start with a bare Lagrangian defined at a scale Ao : 


1 2,1 9,2 Ge’ ga we 
L [Ao] = 5 (89) + 5m a? ae a 


= Lo + Lint, 


with Y = $ (04), Ln = Stroh” Ox On = 10”. Here, ¥# [Ag] is only valid below a cut-off scale Ao. 
The bare couplings { gt encode the unknown physics above Ao. [Ao] includes all the interactions 


consistent with Lorentz symmetry and an additional discrete symmetry under ¢ 4 —d. That is, {O;} 
is a complete set of local operators consistent with a Zz symmetry. If 


[Oi] = Ai, (4) 


then 
EQ =d—A; = 6, (5) 


and so we introduce dimensionless couplings by 
AP = gf AG*. (6) 


We assume that ie ~ O(1) for all i, treating all couplings on an equal footing, including the non- 
renormalizable ones; those satisfying 6; < 0. 


The physics below the cut-off scale Ag is fully captured by the path integral, 
ZIF= f  Do(kyerSlal-seeve, (7) 
k<Ao 


where $ [Ao] = [ d¢zx @ [Ao]. The integration is only over $(k) with k < Ao. 


If we are interested in physics at the same energy scale E < Ag, we can integrate out degrees of freedom, 
for example $(k) for A < k < Ao, where A is a new cut-off scale, A ~ EF. More explicitly, we write 


(hk) = balk) + 4(h), a 
where ¢,(k) has support for k € (0, A), and b(k) has support for (A, Ao). Then, 
k<Ao k<A A<k<Ao 
and hence we have 
Zid] = ff Dan(aeftnite | Da(eyersleardind-seess 
k<A A<k<Ao 


Doa(k) eo Sléa,Al—f d*x 708, 
k<A 


l| 


where S [dq, A] is the effective action at the new cut-off scale A, for the degrees of freedom contained 
in da. S[da, A] encodes the effects of all the high energy degrees of freedom, ¢(k) with k < Ap on the 
degrees of freedom below the new scale A. It captures all of the essential physics at the scale k ~ A. 
S'[@a, A] is, of course, very complicated. We can parameterize it as 


S|éa, A] = s ‘a (11 wn) 5 (ky + ko +... +n) X Fa(B1,---sKn)Oa(K1)---Ga(kn)- (10) 
n=0 i=1 


Here, the 6-function is a consequence of translational symmetry. Expanding F,(k1,..., kn) as a power 
series of k;, we can rewrite 


L lon,A) = 52(06)? + 9:0 (11) 
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where O; is a complete set of local operators of ¢,, allowed by symmetries. Here, 
Z({9 Ao; A) (12) 
is the field renormalization from integrating out the degrees of freedom ¢, and 
{a({9\ + Aor ay} (13) 
are the physical couplings at the scale A. For example, 
Fo(k1, ke) = Fo(k; A) =cotcik? t+... (14) 


where we can write F2 a function of a single k < A because of the 6—function in (10). In coordinate 
space, this contribute to the effective action can be written as 


cod” + Cy (0¢)? 2 See (15) 


So, after changing scale, the Lagrangian 
Lia ayo (0) 
Lo [No] = (29) + dog Oi (16) 


is replaced by 
l 2 
£ [A] = 32 (00) + : GiO;3. (17) 


At first sight, Wilson’s approach seems very complicated: S$ [¢,, A], which is parameterized by an infinite number 


of interactions 
{ai({a?} oy Ay} ) (18) 
depends on an infinite number of bare couplings, 
{9} (19) 


at the scale Ap. To this extent, there is no predictive power. But it turns out there is some additional structure in 
this approach which makes it precisely the right way of thinking about field theories. First, a simple dimensional 
analysis can be used to give us some indication of this. We return to the bare action, 


S{do] = f atx (ae? + 90, 


So Si, 


I 


where g = NO AT 94, and we suppose that we are interested in a process at an energy scale EF. Then, if we assume 


all dimensional quantities are controlled by E; that is, 
kw E, @~E?, O;~ EX, (20) 
then we have 


Sin / Bag O, ~~ BAH4g) — BH g” 


fe, OS = 
a Ao ? 


where we have assumed that 0 ~ O(1), and we have that 6; = d— Aj. Then, for E < Ao, there are three cases: 
1. 6; > 0, or A; > d: irrelevant coupling, S; < 1 and the effect becomes negligibile for EF « Ao. 


2. 6; = 0, or A; = d: marginal coupling, S; ~ O(1). 
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3. 6; <0, or A; < d: relevant coupling, $; > 1. 


So, for E < Ag, the non-renormalizable (irrelevant) bare couplings become suppressed. In particular, in the limit 
- — 0, or equivalently, Ag —> oo, their contributions vanish. Therefore, the low energy physics only depends on 
a small number of renormalizable couplings, the marginal and relevant couplings, satisfying 6; > 0, even though in 
the bare Lagrangian, we took all couplings to appear at comparable strength. More explicitly, if we write 


{9} = {0} + {2} (21) 
(0) 


where {of } are the renormalizable and marginal couplings, and ae are the non-renormalizable couplings, then 
in the limit 4 — 0, 
Ko = a({ of} , No; A) 
_ al{ oh} , Ao; A). 


This crude dimensional analysis will be made precise in later lectures. 


Remarks: 

i In the Wilsonian approach, renormalization comes from integrating out the short-distance degrees of 
freedom. 

2: No ultraviolet divergences are ever encountered. 

3. There are underlying physical reasons for distinguishing renormalizable and non-renormalizable cou- 
plings. 
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5.1: RENORMALIZATION GROUP FLOW 


Consider the bare action defined at a scale Ag : 
ee ee 2 (0) 
S[dol= fate 5 (28) + 7 9.0,, (1) 


(0 


where O; is a complete set of local operators formed from ¢. The theory is specified by the set { 9; . As explained 


in the previous lecture, we can change the cutoff scale to some A < Ag by integrating out the degrees of freedom in 
the interval (A, Ao). This gives 


A 
Sta] = fo ate 5 (06)* + 0 gi(A)O. @) 


after redefining ¢ to absorb the field renormalization factor Z. This theory is specified by the set {g;(A)} . Similarly, 
at another scale A’ < A, we obtain S'[A’], described by {g;(A’)}. These three actions, S,,, Sa, and S,-, should 
all describe the same physics at an energy scale FE < A’ < A < Ag. The relations between them can be found by 
integrating out the degrees of freedom explicitly in the path integral, giving 

gi(A) = 99h”, Ao: A), 

0 
gi(A') = gi(gi, Ao; A’) 
Gi(gi(A), As A’). 

This process describes the renormalization group transformations, or the renormalization group flow: transforma- 


tions between couplings at different scales to ensure they describe the same low energy physics. If we consider, for 


8 


&(A,) 
&(A) 


&(A') 


g 


2 


Figure 1: The renormalization flow as the flow in the space of all possible coupling parameterizations to ensure the 
same low-energy physics at different scales. 
simplicity, the dimensionless couplings {\;(A)} defined by A; = g;A~*, differentiating gives 
dy; 
A—* = B({A;(A 
at = Bil{As(A)}) (3) 


where §; ({A;(A)}) = ag i({Aj(A)} , Z)|,_, - It is important to note that the 6; are only functions of the 
dimensionless coupling constants {A,;(A)}: they do not depend on A explicitly, as can be seen by considering 
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integrating out a fraction of the highest-energy modes in the path integral. The 6-functions give the tangent vector 
of the flow, and depend only on the values of {A;}. Under a relabeling of couplings, 


di = ({Ay}); (4) 
we have that x 
~ [x dd; 
B({X}) = a, OD. (5) 
The $—functions can be computed explicitly from the path integral: 


ZlJ] = i Do oe SiAbal—J a? Jb 
|k|<A 


= i Don (k) i Do(k) eS l¢at+¢,A]-S dx J(¢a+¢) 
|k| <A’ A’<|kI<A 


i Déar(k) eS lear A] -f dee Toys 
|k|<A 


I 


Now, if we let A’ —> A— 6A, S[A— dA] = S[A] + 6S [A], we have 


dS, 
AT = F(Sy) (6) 
Expanding 
Sa, = S- GO; = S- ri A* Oj, (7) 


(6) gives us the 8-functions for all couplings. As an example, let us consider the case of a free scalar field in four 
dimensions, with a cut-off at a scale A. Then, we have 


4 
si= | “Ermonmeawo. (8) 


nea (2)* 


We expand f(k) as a power series in k: 


f(k) = motk? +rykt+... 
F4(A 
= dAm(A)A? +k? 4 1) 4s bee, 
where the coefficient of k? can be chosen to one with a suitable normalization for ¢,. Here, Am(A), 74(A),..- 
are dimensionless couplings: [¢?] = 2, [(@0)"| = 6, and so bm = 2, 6, = —2, for example. We now let 
balk) = bar(k) + o(k) with ¢(k) supported for k € (A’, A) and @, supported for k € (0, A’). Then we have that 
7 d*k z 

Sp [oa] = Sa [on] + Sa [9] +2 font HOw OLR) (9) 


where the last term is zero as ga, and ¢x have disjoint support. Integrating out db only generates an overall constant 
for the path integral, and so 


svloal=Saloul =f S* savor our) (10) 


<a’ (277) 
where f(k) has not changed. That is, 


f(k) me +k? +rak* +... 


l| 


= he(ADAC ote ke? aie 
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and we conclude that 


A\2 A\ 7m 
Dt) So ag A) (=) = \m (A) (=) is a relevant operator, 
A\~2 A\ 7m 
Tae SFG’) (=) = 4 (A) (=) is an irrelevant operator. 
Similarly, 
Xm (A 
By. ge as, ) ee ee On) 
dM yn 
di-4(A’) 7 e 
Bry er = 274 ro —OmTa > 0. 
dM MA 


We note that dimensional quantities like m? and r4 do not change at all in this instance, but that the dimensionless 
couplings flow as they are defined with respect to the cut-off scale. This does reflect the right physics: the relative 
importance of each term in f(k) as we go to lower energies, or smaller k. That is, 
me 
7h becomes larger as k becomes smaller, 
rak* 


k2 


becomes smaller as k becomes smaller. 


We will now derive the full flow equation for S, [¢]. For this purpose, we write it as 


sio.nl=3 f os Gz(k)6(R)4(—k) + Sr [6, A] + U(A) (11) 


where U(A) is a cosmological constant, and the propagator G',(k) satisfies 


at k«aA 
Ga(k) =< ; 12 
a(k) f ko> A. om 
We have that 
Z= | Dole MA Selo, (13) 


where S; = S$; +U. There is now no need to impose an explicit cut-off when integrating over o(k). It is clearly 


K(k) 


A k 


Figure 2: The propagator Ga(k) = ~:«q(k) has a cut-off around k ~ A. 


very complicated to obtain the flow equation for Sr [¢, A] by evaluating the path integral directly. We will instead 
require 


dZ [A 
AGI Lo, (14) 
which is an equivalent statement. From this, we have 
1 d*k _& dG" _ _& 
a (2n)4 ( (bp ker 81) AT = (Anew **). (15) 
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Here, (...) = Zh [Do ...e75, with Z = [Dde~%°. We would like to express the left-hand side of (15) more 
directly in terms of $7. For this purpose, consider 


0= [>sx5 (#()e~S- 5") . (16) 


From this, we have that 


(2ny" 50) (e°5*) — Gq (o(h)O(—Re~*) + (ol EES!) =o. (17) 


The last term in this equation is still complicated. Consider further 


0= [96 MORNE (oS) , (18) 


From this, we have 


(2ny" 8G Rt (emS) — (G51)? (CR) He“) ~ 265" (OW erre™ ) + ( ibe =o, (a9) 


If we multiply 17 by 2G, and add the result to (19), we obtain 


-1/,-S1 -1)? -Sr\ 4 5° —S8;\ _ 
nyt aM OGR (eS) — (Gq)* (OlROL—NeS) + (Ge) =O. (20) 
Eliminating (o(k)o(—k)e~* ) between (15) and (20) gives 
ds -u\ . lf atk ,dGa °° fee 
(acre ° “) = tl (onyt aA Comes ° . 
4 dlogG 
—5 (amy 3(0) | ote at SF eee 


Here, the second term is a constant, and so we have 


d 1 d d*k 
= ] k 21 
nanU = DMdan | Gat OeCaW (21) 
where V4 = (27)* 5((0), and so 
1 d*k 
U(A) = Un + 5Vs f log Gale) (22) 
z (27) 
where Up is independent of A, and 
d 1 d*k . dGa(k) 6? = 
Fee —Syr ~ Sr 23 
no a) Galt aa ape i 
or, equivalently, 
d 5, = ff d*k | dGa(k) | OS; dS 62.97 (24) 
dA" 24 (an)f dA [.56(k) 54(—k) —-54(k)5G(—K) J” 
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Firstly, we will summarize our previous results. We start with a bare Lagrangian, 
L (Ao, 9] = Low gf é (1) 


The path integral 
Z= | Den {aLiAo] (2) 
k<Ao 


describes all the physics below the cutoff Ao. Most often we are interested in physics at some energy scale E < Ao. 
£ |Ao| is not convenient to use, as it contains the degrees of freedom ¢(k) : E < k < Ag which are not directly 
related to the physics at the scale E. However, we cannot simply discard them, as they have indirect effects, which 
can be taken into account by integrating them out. We write (k) = ¢a(k < A) +¢(A <k < Ao), and 


Zo = il Doa(k) / DG(k) e~ Sl?atFAo] — f — Dpq(k) ec Sloa-Al, (3) 
k<A A<k<Ag k<A 


where S[¢a, A] = f d?x >>, gi(A)O:, 9: = Gi ({9! 9; ve Ao; A) . By varying A, we obtain a continuous family of 
S [¢q, A] or a )}. This is the renormalization group flow. 


Ultraviolet 


Infrared 


E A Ae 


Figure 1: The renormalization flow from the cutoff Ag in the ultraviolet to a cutoff A to study processes at an 
energy scale F in the infrared region. 


Infinitesimally, we have 
dS, dr 
A = F(Sq), ASE = Bi (DG(A))), (4) 


where \; = g;A~%. The process means that all S, should describe the same low-energy physics. By dimensional 
analysis, we expect that for * <i, 


a (5) 


(A) ~ Asta) (3 
and so we have three cases: 
A; <d_ relevant, 
A; : <A; =d ~~ marginal, (6) 
A; >d __ irrelevant. 
We expect the initial values of irrelevant couplings should not be important for 4 x, — 0. This rough argument can 


be substantiated by analyzing the flow equation in detail. It turns out that the flow equation can be written in a 
closed form, as we showed in the last lecture: 


S [ba, A] = So [ba, A] + Sint [Ga, Al, (7) 
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K(k) 


A k 


Figure 2: The propagator Ga(k) = gz a(k) has a cutoff around k ~ A. 


where Sp = 5 f eke o(k)(—k)Gq'(k). If Ga = 7s, we have Sp = 4 [ d*x(0¢)?. We considered the case Ga = 


zzka(k), where & provides a cut-off at k ~ A. An example is a sharp cutoff K(k) = O(1— %). This means ¢(k) 
with k > A do not propagate, and there is no need to impose k < A explicitly in the path integral. Requiring the 
partition function to be independent of the choice of A led to the equation 


d Goes | d*k .dGa(k) [ 6S; 5S; 62.97 (8) 
dN”! 24 (anj' dA | dd(k) 5G(—k)— 5d(k)54(—K) 
Remarks: 
Li This equation is exact, and fully non-perturbative. 
2: Awa is only supported near a thin shell of momentum around A. In fact, for Ga = O(1 — *), we have 
Ata x 6(k — A). Physically, this reflects that the flow equation is obtained by integrating out the 
degrees of freedom around A. 
3. Expanding $7 [¢, A] in momentum space as 


Sr [o,A -¥4/ (te 


(8) requires that 


| (Qr)*5 (ky + ko +... + hn) x g(k1,---, kn; A)O(k1)... (kn). (9) 


1 d+ 
AS lbs. Bui) = do o-», 1 asa?) op, Ins A)— 5 / : AS Gla, @, ki, .- +, Rn A) 
{hi ,J2} 
(10) 
where p = omen k;, and I,, Ig are disjoint subsets of momenta such that I) U Ig = {ki,...,kn}. 
{1y,I2} 18 @ Sum over all possible ways to separate {ki,...,kn} into groups. Diagramatically, this is 


shown in figure 3. 


This corresponds to integrating out a tree-level diagram and a one-loop momentum diagram respectively. 


3. We can expand $7 [¢, A] in coordinate space: 


Sr [o,A |= fate D4) ); (11) 


where the O; form a complete set of local operators. From (8), we obtain 


dg; x; ~ 5 
AS = B39; + B?* 959%; (12) 
with 8 = 0. Using dimensionless couplings, \;(A) = g;(A)A7¢-42), we find 
aos 
B= 4 = ADAG + Be rgb, (13) 
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Figure 3: Diagramatic representation of (10), where the crossed vertex represents A224 (p). 


where B? = (A; -d) 62 + B? , and the B? has no diagonal term. We see how this corresponds to the 


cases 
A; >d_ damping, 
A;=0 marginal, (14) 
A; <d__ growth. 
4. The flow equation, (8), and thus the resulting G-functions, are not unique. It can be written in many 
other, equivalent forms by using field redefinitions: 


(x) —> d(x) + ag?(x) + b¢7 (x) + ¢(06)* +.... (15) 


Such field redefinitions lead to redefinitions of the couplings, but they should not change the physical 
observables. There is also a scheme dependence on the choice of cutoff functions, k,(k). The equations 
(12) and (13) are an infinite number of coupled first-order differential equations. It is quite complicated 
to analyze them, and often requires the development of approximation methods. 


Let us consider some general features of the flow: 


1. Separate the couplings, as before, as {A;} = {pa} + {ka}, where {pa} are the relevant and marginal 
couplings, and {«,} are the irrelevant couplings. Then, for a generic theory with all 0° ~ O(1) at Ao, 
we have for * < 1, assuming the {A;} do not become too large, that the {K.} only depend on the {pa}. 
For example, if we consider two couplings, \4 and Ag, for terms of the form ¢* and ¢° respectively, we 


have 
dr4 
AT — 6 + sey 
dn 


Ultraviolet 


Figure 4: Damping and renormalization flow into the point A, = Ag = O for the irrelevant coupling Ag and the 
marginally irrelevant coupling A.. 


as q 0, Ag + As = Aw(As) © ae The flow of A, is given by By = AMt = Me + O(A3), and so Aq is 


marginally irrelevant. 
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We can now make the connection to the standard renormalization procedure. We consider the ¢+-Lagrangian, 


1 1 1 5 
L = —5 (09) — smonyed? — GPG + La, (16) 


where m? and Aq are renormalized physical quantities which can be measure experimentally. They should be 


interpreted as being defined at a specific infrared scale. For example, 


NPY) — 4 (A = 0). (17) 


(phys) 


We now keep Aj and m fixed and take the limit of the cutoff Ag —> oo. All physical observables only 


Bee 
depend on the renormalized quantities. In particular, Oo (Ao) 4 0, 0) = 0. That is, we start along the horizontal 
axis. The Wilsonian approach tells us that such an initial condition is not important. Also note that Ag is not zero 
in the infrared, it is just determined by 4. Ag is related to six-particle scatterings, which of course have non-zero 
amplitude. 
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5.2.2: Fixed Points of Renormalization Flow 


What is the end point of a renormalization group flow as we take A —> 0? This is one of the most important 
physical questions. Let us consider the possibilities: 


1. Some couplings become very strong. This leads to the formation of bound states, and to new infrared 
degrees of freedom, as in quantum chromodynamics. 


2. {A,;} has a fixed point. That is, 6;({A;}) = 0, and so 


dys _ 
dA 


A 0, (1) 


meaning A;(A) = AF is a constant, independent of scale. At a fixed point, a theory is scale invariant. 
As a trivial example of this second case, we will consider a Gaussian fixed point. Let the Lagrangian be given by 
il 2 
£ = 5 (06), (2) 
describing a free, massless particle. In this case, S; = 0, and all of the couplings satisfy 4; = 0, and this is clearly 


invariant under the flow. In fact, our earlier discussion of RG flow and the notion of relevant and irrelevant couplings 
are defined implicitly with respect to this fixed point. 


L = Lo+ Lint; (3) 


where % = $ (06)? and Yint = >>; 9:0;. The dimensions A; of O; are found from the canonical dimension of ¢, 
which is defined using %. For a generic fixed point, A;(A) = A¥ constant, consider a small deviation from the fixed 


Ne 


2 
ss 


Figure 1: Flow diagram for the dimensionless couplings, A4 and Ag. A4 is marginally irrelevant. 


point, 
b= dy =O, (4) 
Then, 
ddd; . 
ATE = pi ({A}+53}) 
dp; 
= Oe ee 
2 i a 
= SOT O+... 
J 
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with Tj; = as . We now consider the left eigenvectors, ef”), of T’, with eigenvalues t,, where @ indexes the 
j r* 
eigenvectors. That is, 
Sob Ty = tae. (5) 
Now, we let 
Ue —_ > eb 5Xj. (6) 
Then dW 
Aa S75 2), 7 
WA Ug + O(U*) (7) 


Disregarding the non-linear terms, we suppose at some Ag that Ug = U2), Then we have that 


Ua(A) = (4\" yO, (8) 


U, is the scaling variable, and —t, = [Uq] is the scaling dimension, as defined with respect to the fixed point {A*}. 


We have that 
>0 relevant, 


—t, =[Ua.]=4=0 marginal, (9) 
<O irrelevant. 


L=Lo4 Lint, (10) 


Z = YAO:, 


Link = S- bA\O; — > U.Oa 


where O, are the scaling operators with respect to Y,. We have that [6a| = d-+t,. Now consider the case of Uj, 
Uz with t; <0 (relevant) and tz > 0 (irrelevant). Then 


Cats) am 


The critical surface is the submanifold in the space of couplings spanned by irrelevant couplings near a fixed 


Figure 2: Flow in the U,, Uz plane and the \j, Az plane about the fixed point X*. 


point. Any point on the critical surface will flow to the fixed point. A point lying off the critical surface will flow 
away from it, as shown in figure 3. The critical surface has a finite co-dimension, since the number of relevant and 
marginal couplings is always finite. 
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Figure 3: The critical surface submanifold. Couplings on the manifold are irrelevant and flow towards the fixed 
point. Couplings off the manifold flow away from it. 


Let us consider, at the ultraviolet scale, Aj, Y = % + Yint, with Y = 1 (09), and Ying = D0; AO; giving 
deviations from the fixed point. 


O; = ¢”, (d¢)” , ¢*, eae (12) 


are scaling operators with respect to %. In general near {A*}, the spectrum of scaling operators {O,.} and their 
conformal dimensions will be very different from those at the Gaussian fixed point. Such non-trivial fixed points 
are very rare at a finite distance away from the Gaussian fixed point, particularly in d > 3. For this reason, they 
provide a very limited theoretical tool. We consider one famous example: 


a 


= 5 (99) Ta 


oe" (13) 
at Ag in d= 4. Then we have 
By =o +..., (14) 


where c is a positive constant, giving a marginally irrelevant flow back to the Gaussian fixed point. If we now 
consider d = 4 — e, then 


d—2 € 
—- of -jes = 1 
= > D=« (15) 
for e < 1. Then we have 
By =—eX +07 +..., (16) 
and so, the fixed point is given by 
Mel <1, (17) 
c 


This fixed point is infinitesimally close to the Gaussian fixed point and can be studied using perturbation theory. 
We now conclude our discussion of the Wilsonian approach to renormalization. Our discussion has been more 
focused on the conceptual picture, rather than the explicity calculation. The Wilsonian approach is conceptually 
very appealing, but technically very awkward, as the couplings depend on the scale. We want to find how to see it 
from the conventional approach. 


5.3: BETA-FUNCTIONS FROM THE TRADITIONAL APPROACH 
We recall the procedure for renormalization, for example in ¢?—theory. 


1: Firstly, we have the Lagrangian 


L(¢oB,9B,MB) = L£(o,9,m) + Let, (18) 
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where A fl 
L = 5 (06) — 5g? + Eye? (19) 
2 2 6 
and 
ye = -5A(06)" - 5 Bm?e? + 5HACO, (20) 
2. We impose the following renormalization conditions to determine the counter terms: 
The) = 21 
w= 7Q- (2) 
I(k? = —m?)=0 (Physical mass condition) 
I’(k2, = —m?)=0 (Physical field condition) 
ka 
V(ki, ko, kz) (22) 
kg 
ky 
V(0,0,0) =g (Physical coupling condition) (23) 
3. All physical observables can be expressed in terms of m? and g. 


We consider more explicitly the one-loop expression for II(k?) and V (ky, ka, ks) : 


Be OT k? oe a Arp? 9 2 
TI(k*) = 5 (2 | 1) (5 tm) + ; dx D log a Ak* — Bm’, (24) 


where D= 2(1—2)k?+m?,a= 


g" 
az Gnyr and 


2 
-V(ki,k2,ks) = 1+ 5 F +k] SO; (25) 


where K = K(m?,k1,k2,k3,). The explicit form of this function is not important here. In order to cancel the 
divergences, we require 


a 

a= a 
Ge 

B= =" 48, 
€ 

Gs, Has 
€ 


where a, b and c are finite constants determined by the renormalization conditions. Therefore, II(k?) and V (ki, ka, k3) 
are independent of the auxillary scale ys and depend only on m? and g. But we can actually choose a, b and c 
arbitrarily, and physical observables will be finite. The previously stated renormalization conditions will generally 
not be satisfied, and so m?, ¢ and g are no longer the physical mass, field and couplings. They can be interpreted 
as parameters of the Lagrangian. The conditions given by 


TI(k? - —™ nys) = 0, 
IE (he = 155,53) = 0; 
V(O, 0, 0) = Jphyss 
and are now solved by general 
Myphys = Mophys (m?, 9); 
Pphys = Qphys (m7, 9, ¢), 
Gphys = YJphys (m?, 9), 
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which are all finite relations. For example, if we set 0 =a=b=c, we find 


a 
Spnys = 9 [L-+ 5K (m2, ns ki = 0), (26) 
where 
? Ary? 
k= dx, dx2dx3 O(a1 + %2+%3—- 1) log Sau (27) 
0 evD 
with : 
D=m+ xn3kt + x1 03ks + 21 2k3, (28) 
and so bee 
Ox Tl 
Gphys = 9 i + a log ( me )| ’ (29) 


where x is a numerical constant. A similar result holds for Te hye We note that, in general, the 4.—dependence does 
not disappear in such relations. In practice, we choose yu, and from Mpnys and gpnys, that is, from the measured 
quantities, we determine g and m and hence all other observables. The physical observables, of course, like Mpnys 
and gpnys do not depend on pi. So we have g(j) and m?(j.), and 


L(¢B,9B,MB) = £(6,9,m) + Let. (30) 


Here, the left-hand side is y-independent. The splitting on the right-hand side requires introducing a scale p. ¢, 
g and m? should be p—dependent, so that the physical observables are z—independent, but the finite part of the 
split is arbitrary. A different choice of a,b and c is a different choice of scheme: the one we have been using until 
now is the on-shell scheme. Different schemes give different g(j), 6(w) and m?(j). Again, the physics should not 
depend on such a choice, corresponding to the different ways of doing the splitting. 
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Let us consider some of the renormalization schemes we discussed in the previous lecture. A particularly 
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convenient renormalization scheme in dimensional regularization is minimal subtraction (MS). In this case, we 
take a= b=c=0. This gives 


ke : rp 
Tags (k?) = -5 [pom | ax Dios (JI. 


il 
g Math ko, k3) 


| 

—_ 
+ 
mw] & 
ey 
Q 
a 
o 
og 
fo 
o 

a 
ys, 
NS 


where f dF3 = tps dx, ie dae le dx36(a1 +2%2+ 23-1), D=2(1—2)k? +m? and 


D=m? + xox3ki + v143k5 + 122k. The k-dependence of g(j) and m() should be such that physical 


observables are independent of jz. In the MS scheme, we take 


giving 


1 ot i An? 
Oo 
€ e 2 g ey }? 


In the on-shell scheme, we have 


1 on 1 on Qphys  € on 
2 (oB,MB, 9B) = 525 (Odpnys)” _ 52m 2 pa Pais + gee Nee reie 


In the MS scheme, we have 


1 (us 1 gMS 
L(¢B,MB, 9B) = 525 (dus)? - 52m Miusbs a3 ee bis: 


Finally, in the MS scheme, we have 


1 GTS) 2 1, ars), 2 2 IMS. < 7(MS) 43 
£($a,mB, 9B) =—52y ~ (bus) — 52m Mazsbary + GME Z GN Oars: 


For the fields, we have 


¢B = Cae ee 


ll 
fo oS 
ak 
= 
Nicoagt 
NIK 
aS 
= 
w 


The three field renormalizations here are all divergent, but their ratios are finite. Why do we not just use the 
on-shell scheme? 


1. 
2. 


There are instances where it can’t be used, for example, if Mpnys = 0. 


Other schemes can be more convenient in certain settings. 
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3. More seriously, consider |k?| >> m?. Then we have 
Dw x(1—2)k’, (6) 
and - 2 
log Do = log — + (7) 
Hence, ‘ 
T(k?) & ois log (=) (8) 


which can be large compared with k?, and so perturbation theory is no longer a good approximation. 
Similarly, 


1 k? 
ree ka, ks) xl + a log wae 5 (9) 


and perturbation theory is not a good approximation for large k?. Introducing p allows us to address 
this problem: if we choose pz ~ k, no such logarithm arises. 


If we choose 4 appropriately, that is, to be comparable to the momentum scale of the physical process, we can 
improve our perturbation expansion. As we will see shortly, 


1. g(4s) and m(y) can be considered as the counterparts of the scale-dependent coupling constants of the 
Wilsonian approach. 


2. The reason we get large logarithmic terms in the on-shell scheme is that we are trying to use coupling 
defined at one scale to describe physics at very different scales. We will return to this point later with 
a physical explanation. 


Let us consider the structure of general correlation functions. Having looked at H(k?) and V(k,,k2,k3) at the 
one-loop level, let us now look at general connected Greens functions in some renormalization scheme, such as the 
MS scheme: 


Gn(@1,--+,%n) = (Q|T(O(41) -.- d(@n))|Q) , (10) 
where ¢(a) is the renormalized field. Then 
Gn({r} ,g(u),m(e); M) = Gn({x}, Ai(u); Hw) (11) 


where \;(2) are dimensionless coupling corresponding to g and m?, defined with respect to pz. For example, 
2 
Mic = me We consider also 


GY) (a1,...,2n) = (Q(T (bdn(21)--. dB (en)| 2). (12) 
Then 
GY) ({x},95,mp; Ao), (13) 


1 
where Ao is an ultraviolet cut-off, independent of yu. Since dg = Z % @, we have that 


GP) Ze Gr, (14) 
and so : 
—(Z?G,) =0. 15 
wt (23Gy) (15) 
Introducing 7 = Han log Z4, we have 
d 
(.2 + nm) G, = 0, (16) 
that is, 
O O 
(ug + 6.g- +0) G 0 (17) 
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where 6; = 4 ‘a ({A;: }) and 6; = 6; ({A;}). From the 
Callan- Syitancile equation, we can express G,,({x} , Ax(u 1); pt’) in terms of G.({2} A(t); “). First, consider 7 = 0. 
Then we have 


— Gp, = 0, (18) 


and so 

Gr({x} Ai’): Hw’) = Gn({r} , Ai); 1). (19) 
These are the running coupling constants we had earlier. In the case y 4 0, y does not depend on yu explicitly, but 
it does indirectly through the y = y (A; (jz)) . In this case, we have 


log py" 
Gin({x}, Ai(u’); w’) = exp -» [ d& y (Aj (E)) | X Gn({x}, Aue); H)- (20) 


Og 


y captures how the definition of @ changes as we change js. In the case that there is only one coupling g, that is, if 
m = 0, then since use =p, 


dg 
eee, 21 
S B (21) 
! ! ee dg’ ! 
Gn({x}, Ai(H’); W’) = exp |—n ary (9!) | X Gn({r} 5 Ai(H); #)- (22) 
log wu Big ) 
Let us consider some applications of this: 
1: For the high momentum behviour, consider 
Go (p,A(u'); w) = 97 (H', H)Go (p, Aa (H)s 1) (23) 
where 7—" = exp |- n ite dé y (A; (€))| . In particular, 
Go (Kp, Kp’): KU’) = 1" (KY, H)G2 (Kp, Ai(Kt); 1) - (24) 
This gives a one-dimensional group, 
1 Pp 
Go (p, Ai(H); MH) = Fe | —, Ail) (25) 
Pp ML 
and so 
Colne, slwu)inn) = ate (2.Aclou)) 
i 
= Go (p, Aik); 1). 
K 
Equivalently, we have 
2 
m? (KL, 
Ga (mp, Adu): p) = MM) 6 (p, a(n). (26) 
2: At a fixed point, 3; = 0, and so »; is a constant, so 7({A;}) is also a constant, and 
* + ny) Ga({2}) = 0 (27) 
a ny) Gn({x}) = 0. 
Consider, for example, G2(a; 1) = wu? f (wax). Then we have that 
d 
yo +2A) fly) =0, (28) 
y 
where A = 2+ y. Hence, 
c 
fy) = sx (29) 
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and F 
Go(0; #) = e- (30) 
More generally, for 
Gn {0} 1) =u? fr ({ua}) (31) 
where Ao is the canonical dimension, from the Callan-Symanzik equation, we have that f,, should satisfy 
fr ({AY}) = A" Fn LY) (32) 


with A = Ag +7. 


In summary, we have introduced the renormalization scale 4, and 2;(j4) are scale-dependent couplings, given by 
the renormalization group flow. Different choices of ~ correspond to different descriptions of physical observables. 
However, the physical observables themselves do not depend on the choice of pu. 
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5.3.2: Computation of Beta-Functions 


We consider the beta functions for the mass m and the coupling g: 


=P Bm = Sagan (1) 


2 
where A,, = ™ (u ). Note that the coupling constants in different renormalization schemes are generally different. 


In ais we fave 


{Ai}: scheme 1, 
{\;}: scheme 2. 


In the problem set, we will see how the 6—functions transform. In particular, the first two terms are universal. 


Example 1: g¢? in d= 6 with MS scheme 


Y= ~5 (068)? - grt + Fa B 
= -5(1+A) (06)? - 5m?( + Bd? + Luk + C9’, 
with 
op = (1+ A)? 6, 
fig AE Be 
gp = gi (1+ A)? (+0), 


A=—¢, B=-—% and C =—4%, up to O(a”). The key is to note that the bare quantities should be independent 


D 
a 


dmg pc 


=0 = 0. 2 
ae as (2) 


This leads to results for 6, and (,,. In general, in the case of dimensional regularization and minimal subtraction, 


gh”) = WO Jiu) + DPE Oy (3) 


where 6;(€) = 6; + aye: the last correction is due to dimensional regularization. From (2), we have 


Bile) (4) 


We can expand 

Bile) = Bi + €0% (5) 
where the first term is the 6—function and the second term again comes from dimensional regularization. If we take 
the —derivative of (3), we find 
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Equating both sides of the above equation order by order in €, we find 


(6; + aje) [i + eG” + mesa +... ] 
(0) 


OG; 
+ |B; tea; +e 1+ (6; +€a;)+e7...| = 0, 
On; 
and so, at O(e), 
a= —Ajaj, (6) 


(note that we are not invoking the summation convention here,) and, at O(e?), 


ag) 


ae Wale) S == 
Ori + aiG} + Bi + - Qa; OAy” = 0, (7) 
or, equivalently, 
i ag) 
Bi = 6:1 — iG 4 2 py (8) 


We note that the 3; are determined by simple-pole residuces of the counter-terms, and that at O(e~”) for n > 1, the 
constraints determine Cw for n > 2 in terms of GM, We now return to our discussion of the gf*—theory. Here, 


gm = ap=ays(l+A)"(14+B), 
5 
920 = mh =P rm (1-3 + hs 
where a = ae. and so we find 
di(e) =e, 6, =0, ay = 1, eu = —=a (9) 
and 
ay 8 
d2(€) 2; 69 2; ag 0, Gy => — Grime. (10) 


From this, we find for the 6—functions, 


Ba = ~0?—30? = a’, 
5 5 

m = —2Am — Ama = —2Am — =Am efi 

B r ra a r ra a+ 


Let us consider the physical implications of these equations. 


ale At weak coupling, a < 1, 8, is dominated by the first term, Bm * —2Am. This gives the dimension in 
the absence of the interaction, which implies the familiar behaviour 


2 
im 
A (4) = Am (oo) (BE) (11) 
Ll 
and so Am (uu) grows quadratically as we decrease wu. 


2. q@ is marginal in the absence of interactions, and so, interactions are important to determine the leading 
contribution. For gd?, 82 < 0, and the coupling is marginally relevant: a becomes stronger going into 
the infrared, as we decrease , and stronger going into the ultraviolet, as we increase w. 


We now integrate 


da 3 9 
= 12 
Pima oe, (12) 
which is equivalent to 
1 3 
d|—) = -=dl : 1 
(+) 5 Clog 1 (13) 


iw) 
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Suppose that a(19) = ao. Then we have 


1 1 3 
= — log a (14) 
alu) a 2 ” Ho 
and hence, 
a 
OP) = eae (15) 
1+ 55° log rr 
In particular, as ~p —> co, a(y) —> 0. This is asymptotic freedom. a(j) —> oo when 320 log a = —1. That 
il 
a(u) 


ae 


log(W) 


Figure 1: We find a linear relationship between a~1(j:) and log(z) with a positive gradient for the ¢° theory. 


2 
is, when « = poe **0 = A. We note that this discussion only applies to a(u) < 1. Of course, our one-loop 
approximation already breaks down before A is reached. Nevertheless, A provides a characteristic scale for the 
system. A is independent of jg. We can rewrite 


1 
log £’ 


(16) 


wl rw 


a(u) = 


and instead of specifying a(f19) = ao, we can simply specify A. The system does not have any dimensionless 
coupling. Rather, it has only a scale A. This is known as dimensional transmutation. 


Now let us go back to the issue of the large logarithms encountered in the on-shell scheme when k? >> m?. We 
encountered @ log #5 z at the one-loop level. However, it goes away if we choose pp ~ k. We want to understand why 
this is, and why we can still trust perturbation theory. To see what is happening, let us consider, for u~ k 


a(Ho = Mm) = a0, (17) 
so that 


ap) = eae ~ ao » (a log HN" . (18) 
These are exactly the logarithmic terms we have seen before. T - were just transferred to the relation between 
a(p~ k) and ag. The higher loop correction give higher powers in ao log £. As a perturbation series in ag, the last 
expression becomes bad when ag log & becomes large, but through the miracle of the renormalization group flow, 
by integrating the 8—function, we have essentially resummed this bad series as far as a(j1) remains small for all pu. 
This remarkable result is the essence of the renormalization group flow, which clearly also applies to the Wilsonian 
approach. 


HL au) 
wetsieete de tORGLe Saeaaaweds ones w+Au 
eer wv 

Mo a(t) 


Hours 2: a(n) and a(uo) are separated by large logarithms, but if we take infinitesimal steps, Aa = 
a? (p) log S# wea Hw Q? (u) 94, and so for a?(j) small, we can ignore the higher order terms. 
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5.3.3: Beta-Functions of Quantum Electrodynamics 


In the case of quantum electrodynamics, we have the Lagrangian 


1 — 
L = —FFE ES — itp (7! (Qu — ten AB) — mp) de, ) 


* - 3 =o s _ 2 
with we = ZS, A® = Z7 Ay, MB = m+ 6m, and eg = Z, *eu?2 or ag = Z3 taps, where a = 4: From our 


earlier result that Z3 = 1 — Zot, we have 


and 


Hence, the running of a(y) is given by 


or, equivalently, 
ao 


a(t) = (5) 


— 200 7 
eee log a 
In quantum electrodynamics, a(j:) increases as js is increased, and a(js) decreases as ys decreases. In particular, 


Zh 
a1) 


a 


Figure 1: We find a linear relationship between a~!(jz) and log() with a negative gradient for quantum electrody- 
namics. 


log() 


the Landau pole at a(j:) —> oo is given by 


p= A= poe*o, (6) 
independently of the choice of fo. Quantum electrodynamics becomes strongly coupled near the scale A. It is 
convenient to express a(j:) in terms of the physical apnys © 77 Which we measure. Consider 

1 e*(u) 
e() 1 e(u) x k2 1 — T(k2)" (7) 
At the one-loop level, 
2 1 
2 e“ (1) de, 
II(k*) = — =) [ dx x(1— x) (- 5 log aa (Z3 — 1), (8) 
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where 72 = ##“, D = m2? +.2(1—2)k? and m? = m2 + O(a), where me is the physical electron mass. It is 


ey ? 


convenient to introduce 


a(t) OB 
k= a : 9 
(K) 1—II(k?) 1-—TIIp(k?) (9) 
Note that this quantity is finite, although the numerator and denominator of the last term are divergent. @(k) is 
an effective k-dependent coupling. In particular, 


a 


m 1 
Ae = a(k = 0) y 137° (10) 
Now 
2 : D 
Ty79(k?) = af dx «(1 — x) log —, (11) 
a 0 H 
and so 
2 
2 e(u), Me _ 2a(p), Me 
pee, = = : 12 
Tgzg(¥? = 0) = “pig log 2 = SE tog (12) 
We therefore have (1) 
= OH 
Oe | — 2a) Jog me’ (13) 
37 bb 
We can compare this to 
a(t) 
a(u’) = —sa (14) 
1— 2atH) log © 
and we find 
Ae = A(ps = Me). (15) 
Therefore, in the MS scheme, 
Qe 
eg 16 
a) = mar og B (16) 
and the Landau pole occurs at A = mMee2ae = mee?*!37, We now consider @(k) = res : 
1. For k? > m?, D~ 2(1—2)k?, and so 
(U9) k? 
Tye(k?) = ~— log 5 +.... i 
ws(k*) ga ee ie + (17) 
Therefore, 
~ _ (Ho) 
A(k) = = eye (18) 
30 8 Ho 


and so &(k) © a(u) for uw ~ k. Note that this is scheme-independent: in any scheme for pp >> me, 
aly) © &(u). 
2. When k? < m?, we have @(k) © a, but a(w) —+ 0 as 4 —> 0. For pp < me, a(u) differs qualitatively 


m 


from the physical coupling. Physically, m_. becomes important, but that is not tracked by the MS or 


a(k) 


137 


-log(m,) -log(k) 


Figure 2: 4(k) as a function of log ¢. At the scale of k ~ me, we have &(me) ~ 7z- 
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MS schemes: a() is no different for a theory with m. = 0. It is more transparent to understand the 
behaviour of @(&) using the Wilsonian approach. The coupling in the Wilsonian action, by definition, 
should track &(k) closely. 


Below the scale of m_., the electron becomes heavy, and we can then integrate it out, leaving a pure 
Maxwell theory, in which the coupling constant does not run. 

3. For massless quantum electrodynamics, with m. = 0, a() is qualitatively correct for 4 —> 0. We then 
find that aes ¢(k) —> 0 as k —+ 0: The theory is marginally irrelevant. 

5.3.4: Beta-Function of Quantum Chromodynamics 


The Lagrangian of quantum chromodynamics, with an SU(N.) gauge group and Ny quarks, where N. = 3 and 
Ny; =6, is given by 


Ny 
1 : _ 
2 = -7T Fw FY iy 4; (yD — m5) 5, a2) 
j=l 
with 
Py = Op Ar = Op Ay — ig [Au A,| ’ 


Ay = Alt*, Fuv = Fii,t*, where t® are the generators of the fundamental representation of SU(N.). The covariant 
derivative is given by 
Dythj = Ours — 1gAyrhy. (20) 


We redefine A,, — Au, and so the Lagrangian becomes 


Ny 

1 bet 

a ag eee —4 7b; ("Dy — m3) vy, (21) 
j=l 


with 
Dythy = Ont; — tAph;- (22) 
We will outline the computation of the G—function for the pure gauge theory using the language of the Wilsonian 


approach, in the Euclidean case. Consider that the theory is provided with a cut off A, and bare coupling g = g(A). 


We now write : 
A, = AY) + A, (23) 


where A is the part below the scale A’, and An is the high-energy part, above the scale A’. Then we have 


Svar [Au] = Svar [AY] + 91 [AQ Ay] (24) 
and 
[PAs SH = foaesl#] [PAue® [4g 4y] 


=f vag se ]-25(00"] 
We take the derivative expansion of AS, 
A 4, 12 
AS = clog 7; aay, +..., (25) 
giving 


1 —. 


1 A 
g?(A’) — g2(A) + clog AY? (26) 
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where c is a pure g—independent number. A precise calculation gives c = any eNO! So, for the G—function, we 
find ' 
g 11 2 
a N Nr}. 27 
Po (47)? ( ao aig r) 27) 


Considering the fermionic sector, we have 


[2920 e SMe d(Dyr—myvt.. det (Dy: — m) 


= clos det(Dy;—m)_ 


If we define a, = x we find 


2/11 D 
es ( Ne Np) = ~bo? (28) 


as we found in the g¢? theory. So, we have 


1 1 LU 
ae = blog —, 29 
aan) ale) 8 ee) 
and hence (110) 
As 
as (tL) ul (30) 


© 1+ as(Ho)blog 


The Landau pole occurs at 


QED 


eu. 


log(u) 
Figure 3: a! scales linearly with log() with a positive gradient in quantum chromodynamics, as with the g¢* 
theory. 

A 

as(J19)blog =? = -1, (31) 
Ho 

and so ; 

Agop = poe FOO re 250MeV, (32) 


independently of our choice of fo. Finally, we put our coupling constant in the form 


1 


= 33 
blog Kegs (33) 


As 7) 


Near Agcp, quantum chromodynamics becomes strongly coupled. The form of this coupling leads to many inter- 
esting phenomena, including confinement, and chiral symmetry breaking: (Ur U, R) #0. 


8.324 — Fall 2005 
B. Zwiebach 


NOTES ON LIE ALGEBRAS 


Lie Algebra G . A vector space G with a bilinear operation | , |: G x G — G such that 
(i): [x, 2] = 0, for all x € G (antisymmetry), 
(ii) [x, [y, z]] + [y, [z, z]] + [z, [z, y]] = 0, for all x,y, z € G (Jacobi identity). 

Typically the vector space is over the real numbers (a real vector space) or over the complex 


numbers (a complex vector space). 
A Lie subalgebra of G is a vector subspace of G which is itself a Lie algebra under | , |. 


The generators 7, of G with a = 1,2,...,d are a set of basis vectors in G. Here d is 
the dimension of G. The Lie algebra is defined if we give the Lie brackets |Z), Tj] of all the 
generators. One writes 

[To, Te] = faw® Tes (1) 
where the structure constants f,,,° are real if the Lie algebra is a real vector space, or 


complex if the Lie algebra is a complex vector space. 


An ideal Z of G is an invariant subalgebra of G, namely, [G,Z| C Z . An ideal is proper 
if it is not equal to {0} nor to G (both of which are trivial ideals of G). The quotient space 
G/T is readily checked to be a Lie algebra. 


The derived algebra G of G is the set of all linear combinations of brackets of G. We 
write GY = [G,G]. G™ is an ideal of G. One defines GY) = [G,G®] for i > 1. In 
the derived series of ideals G,G),... each term is an ideal of G (this is proven using 


induction and the Jacobi identity). G is solvable if its derived series ends up with {0}. 


If Z and J are ideals of G then Z + JZ is also an ideal of G. If, in addition both Z and J are 
solvable, then Z + J is also solvable (show by induction that (I+ 7)™ cI™ + J. Since 
T is solvable, at some stage the derived series of (Z + 7) goes into the derived series of 7, 
which also terminates). 

The radical G,. of a Lie algebra G is the maximal solvable ideal of G, i.e. one enclosed in no 
larger solvable ideal. It follows from the above additivity property that G,. is unique. 

The center Z of G is the set of all elements of G that have zero bracket with all of G. The 


center of G is clearly an ideal of G. 


An abelian Lie algebra G is a Lie algebra whose derived algebra G‘"! = [G,G] vanishes (the 
Lie bracket of any two elements of G is always zero). For arbitrary G, the quotient G/G‘" is 
an abelian Lie algebra. There is a unique one-dimensional Lie algebra, the abelian algebra 
uz; with a single generator T and bracket [7,7] = 0. Any d dimensional abelian Lie algebra 


is (isomorphic to) the d-fold direct sum of one-dimensional Lie algebras. 


The direct sum G, 6G. @--- of Lie algebras with brackets [, ]1, |, ]2 --- is the sum of the 
vector spaces with a bracket |, | defined as: (i) [z, y] = [z, y]; when x,y € Gj, and (ii) 
[G:,G;| = 0, for i A 7. Each summand is an ideal of the direct sum. 


A Lie algebra G is simple if it has no proper ideals and is not abelian. Note the following 
obvious consequences: 
(i) the derived algebra of G (an ideal) must equal G ; G is not solvable, 


(ii) being not solvable and having no proper ideals, its radical G, vanishes, 
(iii) the center of G must vanish, 
( 


iv) G cannot be broken into two sets of commuting generators. 


A Lie algebra G is semisimple if its radical G, vanishes. Simple Lie algebras are semisimple. 


It can be shown that semisimple algebras are direct sums of simple Lie algebras. 


A reductive Lie algebra is the direct sum of an abelian algebra and a semisimple algebra, 
with both nonvanishing. This is the case of interest for non-abelian gauge theory. In these 


algebras the radical equals the center (the abelian algebra). 


Comments. The general Lie algebra G is either solvable or not solvable. The solvable algebras 
are not easy to classify. If the algebra G is not solvable then either the radical vanishes, in 
which case the algebra is semisimple, or the radical does not vanish, in which case the 
quotient (G/G,.) is semisimple (it can be shown that it has zero radical). Any Lie algebra G 
has a Levi decomposition G = P @, A, as the semidirect sum of a solvable algebra P and 
a semisimple algebra A. In the semidirect sum the bracket of elements within summands are 
the brackets of the respective algebras, and the bracket of mixed elements are defined using 
a representation o of A on the vector space of P. 

Example: The Poincare algebra P. It has two familir subalgebras spanning together P as 
a vector space; the Lorentz algebra A (semisimple) and the translation algebra P (abelian). 
The Poincare algebra is not solvable since A is not, is not semisimple since its radical equals 
the non-vanishing algebra P, and is not reductive since it is cannot be written as a direct 


sum of a semisimple and an abelian part. One has P = P , A. 
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THE CLASSICAL LIE ALGEBRAS 


Let V be a vector space over a field F' (R or C). The general linear algebra gl(V) is the 


algebra of endomorphisms (linear transformations, not necessarily invertible) of V. As a 
vector space over F’, End(V) has dimension (dim(V))?. The Lie bracket is just the commu- 
tator of the linear transformations. More concretely, when V is a vector space of dimension 
n over F’, one can think of gl(V) as the algebra gl(n, F’) of n x n matrices with entries in F’. 
The classical algebras fall into four families Ay, By, Cy and Dy. These are all subalgebras of 
gl(V). Any subalgebra of gl(V) is called a linear Lie algebra. 
Ay: Let dim V = ¢ +1, the algebra is called sl(¢+ 1, F’) (for special linear) and is that of 
endomorphisms of zero trace (a basis independent restriction). Its dimension is (¢+ 1)? — 1. 
To describe the next three families of algebras we will use bilinear forms f(v,w) on an 
n-dimensional vector space V: 

f(v,w) =v'sw , (2) 


where, for a chosen basis, s is a fixed invertible n x n matrix and v,w € V. We then consider 


V endomorphisms x such that for all v and w 
f(a(v),w) =—flv,2(w)) > se= a's, (3) 


We claim that the set of endomorphisms that satisfy (3) form a Lie algebra under commu- 
tation. Indeed, if x, and x2 satisfy f(x;(v),w) = —f(v, 2;(w)) so does |x, x2]. Additionally, 


| — _x' we deduce that x has zero trace. The Lie algebra in ques- 


tracing the relation sas~ 
tion is thus a subalgebra of sl(n, F’). The bilinear form f(v, w) is symmetric (antisymmetric) 
under the exchange of v and w if s is a symmetric (antisymmetric) matrix. 


C, : Let dim V = 2¢ and choose some specific basis in which 


0 Le 
= (4, i). 0 


The Lie algebra of endomorphisms that satisfy (3) is called sp(2¢, F’) (for symplectic). Also, 
sp(20, F) Cc sl(2é, F). For a general matrix x € sp(2¢, F’) we find that the @ x ¢ blocks take 


the form 
o=(” ') + gan p= im ==¢. (5) 


The dimension is readily found; we need two symmetric ¢ x ¢ matrices p and n giving ((€+1), 


and one arbitrary matrix m (that determines q) giving @?. Thus dim sp(2¢, F) = ¢(20+ 1). 
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B, : Let dim V = 2+ 1. This time 

0 0 

> ode M3 (6) 
Ip O 


The Lie algebra of V endomorphisms z that satisfy (3) is called o(2@+ 1, F’) (for orthogonal). 
Note that 0(2@+ 1, F) C sl(20+ 1, F). Moreover, dim 0(20+ 1, F) = €(2€4+ 1) (the same 
dimension as that of sp(2@, F)). 

D,(é > 2) : Let dim V = 2¢. This time 


0 Le 
=(f a (7) 


The Lie algebra of V endomorphisms x that satisfy (3) is called 0(20, F). Note that 0(2@, F’) Cc 
sl(2@, F) and that dim 0(22, F’) = €(2@ — 1). 

The above description given for the orthogonal algebras actually correspond to the maximal 
noncompact forms. The real orthogonal algebra o(f) is obtained with a real vector space V 


with dimV = @ and s = Jy. This is the algebra of real antisymmetric matrices 7 = —2*. 


COMPLEX AND REAL LIE ALGEBRAS 


The relevant issues are clarified with an example involving Lie algebras with three generators. 
Consider the Lie algebra sl(2,C’) of complex traceless 2 x 2 matrices. The Lie bracket is 
commutator (preserves tracelessness). This is naturally a vector space over the complex 
numbers as traceless matrices remain traceless by multiplication by a complex number. The 


algebra is therefore a complex Lie algebra. We can choose a basis of generators 


01 0 0 1 0 
n= (¢ a zt ae n=(t ak (8) 


The brackets are given 
[eaecarh | = J3, [ J3 , ghee = 2s: (9) 


It is possible to show this complex Lie algebra is the unique simple complex Lie algebra 
with three generators. Even though the algebra is complex we can easily get a real algebra 
since the brackets in (9) have only real numbers. We can declare the vector space to be real 


and say that the abstract basis vectors (J,, J_, J3) have the brackets in (9). This is now a 
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real Lie algebra. More concretely we can define the real algebra as the algebra sl(2, R) of 
traceless 2 x 2 real matrices (naturally a real vector space). The generators in (8) are good 
ones for this algebra. 


There is a way to construct another real algebra from s/(2,C), this time take 


; ps0" ah .{O -a 
2 = iy td.) = 41 a 25 = 1 = 1.= (| a 


(10) 
: _f1 0 
2 $3 = —iJ3 = 1 i >) : 
We recognize S, = —io,/2, with o, the (hermitian and traceless) Pauli matrices. The 
brackets have real structure constants 
[Ops | = Eee (11) 


Declaring the S; to be basis vectors of a real vector space we get the familiar simple real Lie 
algebra su(2), described as the algebra of 2 x 2 traceless antihermitian matrices. Although 
the matrices have complex entries, the vector space is naturally real — complex multiplication 
ruins antihermiticity. The real algebras sl(2, R) and su(2) are not isomorphic over the reals, 
they are the two real forms associated with s/(2,C). 

The real forms of sl(@+1,C) are sl(€+1, R) and su(€+1), defined as the algebra of traceless 
antihermitian (€+ 1) x (€+ 1) matrices (a compact subalgebra of sl(¢+ 1,C)). 

The real Lie algebra u(@) is defined as the algebra of ¢ x ¢ antihermitian (complex) matrices. 
It has real dimension ¢? (the associated Lie group is the unitary matrix group U(¢)). 

The real Lie algebra usp(2@) is the algebra of antihermitian matrices in sp(2@,C). Using (5) 


the antihermiticity gives 


(" +.) with m'=—m,n'=n. (12) 
The real dimension of usp(2¢) is (20+ 1). The algebras usp(2¢) and sp(2¢, R) are two real 
forms of sp(2@,C). Actually usp(2¢) is a compact real form while sp(2@, R) is a non-compact 


real form. 


REPRESENTATIONS 


A representation of a Lie algebra G on a vector space V is a linear map W: G — gl(V) 
which is a homomorphism of Lie-algebras (w([21, %2]) = [W(v1), Y(%2)], for all x1, 22 € G). 
It is an irreducible representation if the image of G in gl(V) acts on V without a proper 


invariant subspace. 


Schur’s lemma. Let w:G — gl(V) be an irreducible representation on a finite dimen- 
sional complex vector space V. Let a: V — V be a linear mapping commuting with ~(z) 
for all x € G. Then a is proportional to the identity map: a = al for some number a. Proof: 
a must have one eigenvector in V with some eigenvalue a since det(a — AJ) = 0 must have at 
least one (complex) root A = a). Then show that the set of vectors in V with a eigenvalue a 
form a nonvanishing invariant subspace. Conclude that the invariant subspace must be the 


whole V, so a(v) = av for any vector in V, which means a is a multiple of the identity.). 


The adjoint representation of the Lie algebra is a representation where the vector space in 
question is precisely the Lie algebra: V = G. We write ad : G — gl(G) and define ad x as 
the linear map 


ad a: a—layy|), vor: ada = apa]. (13) 


To verify it is a representation we must check that the linear maps satisfy 
[ad a,ad y] =ad [x,y]. (14) 


This is verified acting on an element of the algebra and using the Jacobi identity. 


The adjoint representation of a simple Lie algebra is irreducible for otherwise, by (13), the 
invariant subspace would be an ideal. For semisimple algebras the adjoint representation is 


reducible. 


The kernel of ad is formed by the elements x of the algebra that generate the zero map, and 
therefore must have zero brackets with everything. So the kernel of ad coincides with the 
center Z of the Lie algebra. If G is simple its center vanishes and the map ad : G — gl(G) 


is one to one; any simple Lie algebra G is isomorphic to a subalgebra of gl(G). 


Looking at the generators we have 
ie Ow Gre Ge aah Oe Ol aae ey ara (15) 
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To find the explicit matrix form of the adjoint action we view TJ), as the column vector with 
all entries equal to zero except for a 1 in the b-th entry. Then ad T, (T;) = T.(ad T,,)., and 
we thus conclude that 

(ad Ta )eb = fas’ - (16) 


An arbitrary representation r of G of dimension d, is defined by d, x d, matrices t! that 
represent the generators: 
[tests |] — Sap te (17) 


On the Lie algebra there is a well defined symmetric bilinear form k (-, -) called the Killing 


metric and defined as (with o denoting composition of linear maps) 
K(z,y) = —Tr (adw oad u) ’ (18) 


K(x, y) is simply minus the trace of the operator |x, |y,-]]. This metric is uniquely defined, 
up to a multiplicative constant, by two properties: 
(i) invariance under automorphisms o of the Lie algebra (a : G — G is an automorphism if 


[x,y] = z implies that [o(x), o(y)| = a(z). o defines an element in gl(G).) 


(a(x), o(y)) = K(a,y) 5 (19) 
(ii) associativity, in the sense that 
K([e,y],2) = (2, [y, 21). (20) 


One readily verifies that (18) satisfies (19) (note that ado(r) = 0 0oadxoo) and (20). 


A fundamental result of Killing and Cartan is that a Lie algebra is semisimple if and only 
if its Killing form is nondegenerate. A bilinear form «(-,-) is degenerate if there is a 
non-zero vector x such that «(x,-) =0. A particular case is readily verified; algebras with 
abelian ideals (thus not semisimple) have degenerate Killing forms. To see this split the 
algebra generators into two groups : those in the abelian ideal J and those outside J. With 
x € I, K(x,y) = 0 for all y or, equivalently, the trace of the operator [z, |y,-]] vanishes. 
This is clear because acting on elements of J this operator gives zero, and acting on elements 


outside I it gives elements inside J. Thus « is degenerate. 


A Lie algebra is said to be compact semisimple if the Killing form is positive definite, 


that is (a,x) > 0 for x £0. The corresponding Lie group is a compact manifold. 
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While the product of elements x,y € G is not defined, having representations w(x), W(y) we 
can multiply them together and the Lie bracket becomes a commutator. This suggests the 
definition of the universal enveloping algebra U(G) associated with the Lie algebra G. It is 
the associative algebra spanned by monomials in the generators of G, (say, v7, y, 27, ry, yZ,*--) 
where monomials are identified upon use of the brackets as commutators (if [7,y] = z, we 


can say that in the enveloping algebra ry = yx + z). 


A representation of a Lie group G on a vector space V is a map G — GL(V) which is 
a homomorphism of Lie groups. Here the general linear group GL(V) is the group of 
invertible linear maps V — V. Concretely, a representation r of dimension d,. a Lie group G 
is defined by an invertible d, x d, matrix D"(g) for each g € G such that 


D"(g1)D" (g2) = D" (9192), for all 91,92 EG. (21) 


Given a Lie group G with Lie algebra G the adjoint representation of the group is a map 
Ad: G — Aut(G) that associates to each element g € G an invertible linear transformation 


Ad g which is an automorphism of the Lie algebra G: 
Adg[a, y] = [Adg(x), Adg(y)]. (22) 
It follows immediately from this and (19) that the Killing form is Ad-invariant: 


«(Ad g(x), Adg(y)) =K(2,y). (23) 


As befits a representation, one must have 


Ad (gig2) = Ad (gi) Ad (92) . (24) 


The representation can be described concretely as (with repeated indices summed) 
Adg: T;-—- Ada (T,) = Th Diala); (25) 


where Dj(g) denotes dim(G)xdim(G) matrix representation of Adg. One readily checks 
that the above two equations imply that, as expected, Day(g1g2) = Dac(91)Des(g2)- 


Geometrically one understands the adjoint group action on the algebra as follows. Recall 


that the Lie algebra G associated with a Lie group G can be identified with the tangent space 


1 


of G at the identity. For any fixed g € G the transformation h — ghg~* (conjugation by g) 
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is a map of G to itself leaving the identity element fixed and inducing a linear transformation 
from the tangent space at h to the tangent space at ghg~!. Therefore, this map induces a 
linear transformation on the tangent space at the identity, a linear transformation on the 
Lie algebra!. 

The adjoint action can be described using any G representation and a G representation of 
the same dimensionality. Let t” be matrices representing the generators 7), in a representation 
r of G and let D"(g) be matrices of the same dimension in the representation r of G. The 


action of Ad g is via conjugation with D"(g), and (25) gives 
D"(g)t, D'(g)' = te Dralg) - (26) 


If the representation r of G is the adjoint, D"(g) = D(g). Note that the Dyq(g) on the above 


right-hand side are numbers, not matrices. 


For a matrix group G whose elements V € G are k x k matrices, the generators T, of 
the associated Lie algebra G are themselves k x k matrices (this is called the fundamental 
representation) and (25) and (26) read 


AdV(T,) =V T,V7! =T3 Dia(V). (27) 


COMPACT SEMISIMPLE LIE ALGEBRAS 


Consider again the Killing metric 
Kab = K (Ty, T,) = —Tr(adT, oad T,) = —Tr(taty). (28) 


where t, denotes the matrix representation of the adjoint action (ad T,). By (16) we have 


(ta)eb = f°. The metric is real since the structure constants are real. We note that 


—Tr (fa, &] fe) = fox? Kee - (29) 


Cyclicity of the trace implies that the f,,° Kec is totally antisymmetric in a,b and c. 


'More precisely the Lie Algebra G associated with a group G is defined as the set of all left-invariant 
vector fields on the group manifold. The bracket is the Lie bracket of vector fields, viewed as differential 
operators. As a vector space, the set of left-invariant vector fields is isomorphic to the tangent space to the 
group at the identity (each tangent vector at the identity can be extended to a left-invariant vector field). 
By left-invariant one means invariant under diffeomorphisms of the group induced by left multiplication: 
g— ag, Vg EG. 


Since the matrix K,, is symmetric, real, and positive definite (G is compact semisimple)? 
there is an real orthogonal O matrix that diagonalizes k, namely Ok OT = D, with D a 
diagonal matrix with positive entries. We can define new generators T’ = O,-T.., and then 
we verify that 

—Tr( tats )=Dap. 
By a further (real) scaling all diagonal elements can be made equal to a constant denoted 


as C(G). In the resulting basis, which we call again T,,, we have 


AT (Et y= OG) be She (30) 


In this basis we write [Tu, 7b] = f.,° Tc. Since f,°Kec is totally antisymmetric, (30) implies 


that fi See = f,p° 18 totally antisymmetric. We thus define a totally antisymmetric symbol 


habe = Fie (31) 
and write 
oe T,| = Fake ie (ad ees fo = ae : (32) 


We have shown that for compact semisimple real Lie algebras there is a basis in which 
equations (30) and (32) hold. 


We now verify that the adjoint representation of G acts on G via real orthogonal matrices. 


For this we begin with (30), insert D matrices, and use (26) applied to the adjoint: 


—C(G)6ap = Tr(ta te) 


= Tr (D(g) f D-"(9)D(9) &D-"(9) ) 
: (33) 
seat ae a )Dyo(g) 
—C(G)Dealg o(9) 
showing that the matrix D(g) is indeed orthogonal (D7 D = 1). 
For arbitrary representations of the Lie algebra we define the matrix & (r) 
Kaa(r) = —Tr (tht). (34) 


Positivity also follows if the matrices in the adjoint representation of G are antihermitian: any x € G is 
represented in the adjoint by an X(= —X*) and k(z,x) = —Tr(XX) = Tr(XTX) > 0. 
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Inserting group matrices D"(g) as in (33) and using (26) we see that 


Kab(?) = Kear) Dealg)Da(g) = K(r) = D'(g) K(r)D(g). 


Since D(g) is orthogonal it follows that «(r) commutes with all matrices D(g). For simple 
algebras the adjoint representation of the corresponding group acts irreducibly. By Schur’s 


lemma this means that «(7) must be proportional to the identity. We thus define 


—Tr (#2) = C(r) bap. (35) 


In the chosen basis, the element T,7, (a summed!) of the enveloping algebra is called 
a Casimir operator. It commutes with all 7,’s and therefore with all elements of the 
enveloping algebra. Indeed, with repeated indices summed, 

[ele J, po La (Facto) + (fac dy) he = (feeb io Teen) tale = 0; (36) 
More geometrically, for any semisimple algebra «TT, is a Casimir, where the inverse «2? 


of the Killing metric kg, exists because of semisimplicity. 


Since 77,7), is a Casimir, in any irreducible representation r the Casimir matrix t” t” commutes 
with all the matrices t” representing generators. The Casimir matrix, by Schur’s lemma, must 


be a multiple of the identity in any irreducible representation: 
hte = Cat) dae (37) 


where d(r) denotes the dimension of the representation r. The constants C(r) and C3(r) are 


simply related; taking the trace of (37) and the contraction of (30) one finds 
C2(r) dim(r) = C(r) dim(G) . (38) 
For the adjoint representation (37) gives 


Teen Foca = Co(G) dan oe C2(G) = Toke fie (39) 


iin) ) 
and (38) implies that C2(G) = C(G). As defined, the constants C' and C, are basis dependent. 


Tt 


